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Abstract 

Recently the second two authors characterized quasiperiodic Sturmian words, proving that 
a Sturmian word is non-quasiperiodic if and only if it is an infinite Lyndon word. Here we ex- 
tend this study to episturmian words (a natural generalization of Sturmian words) by describing 
all the quasiperiods of an episturmian word, which yields a characterization of quasiperiodic 
episturmian words in terms of their directive words. Even further, we establish a complete char- 
acterization of all episturmian words that are Lyndon words. Our main results show that, unlike 
the Sturmian case, there is a much wider class of episturmian words that are non-quasiperiodic, 
besides those that are infinite Lyndon words. Our key tools are morphisms and directive words, 
in particular normalized directive words, which we introduced in an earlier paper. Also of im- 
portance is the use of return words to characterize quasiperiodic episturmian words, since such 
a method could be useful in other contexts. 

Keywords: episturmian word; Sturmian word; Arnoux-Rauzy sequence; episturmian mor- 
phism; lexicographic order; infinite Lyndon word; quasiperiodicity. 

MSC (2000): 68R15. 

1 Introduction 

Sturmian words are a fascinating family of infinite words defined on a 2-letter alphabet which 
have been extensively studied since the pioneering work of Morse and Hedlund in 1940 (see |36j). 
Over the years, these infinite words have been shown to have numerous equivalent definitions and 
characterizations, and their beautiful properties are related to many fields like Number Theory, 
Geometry, Dynamical Systems, and Combinatorics on Words (see [Tl 1311 1581 1^1 f° r recent surveys). 

Many recent works have been devoted to generalizations of Sturmian words to arbitrary finite 
alphabets. An especially interesting generalization is the family of episturmian words, introduced 
by Droubay, Justin, and Pirillo in 2001 [12] (see also |23|, I25j for example). Episturmian words 
include not only the Sturmian words, but also the well-known Arnoux-Rauzy sequences (e.g., see 
[5l 1241 l38l I46j ) . More precisely, the family of episturmian words is composed of the Arnoux- 
Rauzy sequences, images of the Arnoux-Rauzy sequences by episturmian morphisms, and certain 
periodic infinite words. In the binary case, Arnoux-Rauzy sequences are exactly the Sturmian words 
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whereas episturmian words include all recurrent balanced words, that is, periodic balanced words 
and Sturmian words (see |X8|, [371 [4"4"] for recent results relating episturmian words to the balanced 
property). See also [T7J for a recent survey on episturmian theory. 

Episturmian morphisms play a central role in the study of episturmian words (Section 12.31 recalls 
the definition of these morphisms). Introduced first as a generalization of Sturmian morphisms, 
Justin and Pirillo [23] showed that they are exactly the morphisms that preserve the aperiodic 
episturmian words. They also proved that any episturmian word is the image of another episturmian 
word by some so-called pure episturmian morphism. Even more, any episturmian word can be 
infinitely decomposed over the set of pure episturmian morphisms. This last property allows an 
episturmian word to be defined by one of its morphic decompositions or, equivalently, by a certain 
directive word, which is an infinite sequence of rules for decomposing the given episturmian word by 
morphisms. In consequence, many properties of episturmian words can be deduced from properties 
of episturmian morphisms. This approach is used for instance in [7} I16 [ l27l 1431 1441 146] and of course 
in the papers of Justin et al. 

Morphic decompositions of Sturmian words have been used in [29J to characterize quasiperiodic 
Sturmian words. Quasiperiodicity of finite words was first introduced by Apostolico and Ehren- 
feucht [3] in the following way: "a word w is quasiperiodic if there exists a second word u ^ w 
such that every position of w falls within some occurrence of u in w". The word u is then called a 
quasiperiod of w. In the last fifteen years, quasiperiodicity and covering of finite words has been 
extensively studied (see [21 [21] for some surveys) . In [33] , Marcus extended this notion to infinite 
words and opened some questions, particularly concerning quasiperiodicity of Sturmian words. Af- 
ter a brief answer to some of these questions in [27J, the Sturmian case was fully studied in [29] 
where it was proved that a Sturmian word is non-quasiperiodic if and only if it is an infinite Lyndon 
word. Here we extend this study to episturmian words. 

In Sections [2H3l we recall useful results on episturmian words and their directive words. A 
particularly important tool is the normalization of words directing the same episturmian word, 
which we recently introduced in [281 119] . This idea allows an episturmian word to be defined 
uniquely by its so-called normalized directive word, defined by some factor avoidance. It can be 
seen as a generalization of a previous result by Berthe, Holton, and Zamboni [7J, which was used in 
[29] to show that the directive word of a non-quasiperiodic Sturmian word can take only two possible 
(similar) forms. For non-binary episturmian words, even those defined on a ternary alphabet, this 
simplicity does not hold since a combinatorial explosion of the number of cases occurs. In particular 
there exist ternary non-quasiperiodic episturmian words that have infinitely many directive words. 
As such, the method we use to characterize quasiperiodic episturmian words greatly differs from 
the one in the Sturmian case. 

In Section [H we characterize quasiperiodic episturmian words. To prove it, we introduce a 
new way to tackle quasiperiodicity by stating an equivalent definition that is related to the notion 
of return words. From this, we show that any standard episturmian word (or epistandard word) 
is quasiperiodic; in particular, sufficiently long palindromic prefixes of an epistandard word are 
quasiperiods of it. We then extend this result by describing the quasiperiods of any (quasiperiodic) 
episturmian word (see Theorem 14. 19[) . This yields a characterization of quasiperiodic episturmian 
words in terms of their directive words (Theorem I4.28P . Note that the set of quasiperiods of an 
episturmian word was previously described only for the Fibonacci word [27J. (In [29] this set was 
not described for quasiperiodic Sturmian words) . At the end of Section HI using the normalization 
aspect, we give a second characterization of quasiperiodic episturmian words which itself provides an 
effective way to decide whether or not a given episturmian word is quasiperiodic (see Theorem l4.29p . 

Section [5] is concerned with the study of the action of episturmian morphisms in relation to 
quasiperiodicity. This study leads to non-trivial extensions of results in [29] . Using this approach, 
we provide a completely different proof of our main characterization of quasiperiodic episturmian 
words. We also characterize episturmian morphisms that map any word onto a quasiperiodic one 
(see Section 15. 4p . This result naturally allows us to consider quasiperiodicity of words defined using 
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episturmian morphisms. 

Lastly, in Section [6l we characterize episturmian Lyndon words in terms of their directive words. 
This result shows that, unlike the Sturmian case, there exist non-quasiperiodic episturmian words 
that are not infinite Lyndon words. 

2 Episturmian words and morphisms 

We assume the reader is familiar with combinatorics on words and morphisms (e.g., see [301 131j). 
In this section, we recall some basic definitions and properties relating to episturmian words which 
are needed throughout the paper. For the most part, we follow the notation and terminology of 

pl [231 EH EE]. 

2.1 Notation and terminology 

Let A denote a finite non-empty alphabet. A finite word over A is a finite sequence of letters from 
A. The empty word e is the empty sequence. Under the operation of concatenation, the set .4* of 
all finite words over A is a free monoid with identity element e and set of generators A. The set of 
non-empty words over A is the free semigroup A + = A* \ {e}. 

Given a finite word w = x\X2 ■ ■ ■ x m G A + with each Xi G A, the length of w, denoted by \w\, 
is equal to m. By convention, the empty word is the unique word of length 0. We denote by \w\ a 
the number of occurrences of the letter a in the word w. If \w\ a = 0, then w is said to be a-free. 
The reversal of w, denoted by w, is its mirror image: w = x m x m -i ■ ■ ■ x%, and if w = w, then w is 
called a palindrome. 

A (right) infinite word (or simply sequence) x is a sequence indexed by N + with values in A, 
i.e., x = X1X2X3 ■ ■ ■ with each x^ G A. The set of all infinite words over A is denoted by A w . An 
ultimately periodic infinite word can be written as uv w = uvvv • • • , for some u, v £ A* , v ^ e. If 
u = e, then such a word is (purely) periodic. An infinite word that is not ultimately periodic is 
said to be aperiodic. For easier reading, infinite words are hereafter typically typed in boldface to 
distinguish them from finite words. 

Given a set X of words, X* (resp. X u ) is the set of all finite (resp. infinite) words that can be 
obtained by concatenating words of X. The empty word e belongs to X*. 

A finite word w is a factor of a finite or infinite word z if z = uwv for some words u, v (where 
v is infinite iff z is infinite). In the special case u = e (resp. v = e), we call w a prefix (resp. suffix) 
of z. We use the notation p~ 1 w (resp. ws~ 1 ) to indicate the removal of a prefix p (resp. suffix s) 
of the word w. An infinite word x G A w is called a suffix of z G A w if there exists a word w G A* 
such that z = wx. That is, x is a shift of z, given by x = T'^^z) = w~ 1 z, where T denotes the 
shift map: T((x n ) n >i) = (x n +l)ri>l- Note that a prefix or suffix u of a finite or infinite word w is 
said to be proper if u 7^ w. For finite words w G .4*, the shift map T acts circularly, i.e., if w = xv 
where x G A, then T(w) = vx. 

The alphabet of a finite or infinite word w, denoted by Alph(io) is the set of letters occurring in 
w, and if w is infinite, we denote by Ult(io) the set of all letters occurring infinitely often in w. 

A factor of an infinite word x is recurrent in x if it occurs infinitely often in x, and x itself is 
said to be recurrent if all of its factors are recurrent in it. Furthermore, x is uniformly recurrent 
if for each n there exists a positive integer K(n) such that any factor of x of length at least K{n) 
contains all factors of x of length n. Equivalently, x is uniformly recurrent if any factor of x occurs 
infinitely many times in x with bounded gaps [9]. 

2.2 Episturmian words 

In this paper, our vision of episturmian words will be the characteristic property stated in Theo- 
rem [2J] (below). However, we first give one of their equivalent definitions to aid in understanding. 
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For this, we recall that a factor u of a finite or infinite word w £ A°° is right (resp. left) special if 
ua, ub (resp. au, bu) are factors of w for some letters a, b £ A, a b. 

An infinite word t € A w is episturmian if its set of factors is closed under reversal and t has at 
most one right (or equivalently left) special factor of each length. Moreover, an episturmian word 
is standard if all of its left special factors are prefixes of it. 

In the initiating paper [12], episturmian words were defined as an extension of standard epis- 
turmian words, which were themselves first introduced and studied as a generalization of standard 
Sturmian words using palindromic closure (see Theorem 14.51 later) . Specifically, an infinite word 
was said to be episturmian if it has exactly the same set of factors as some standard episturmian 
word [12j . This definition is equivalent to the aforementioned one by Theorem 5 in [12]. Moreover, 
it was proved in [12J that episturmian words are uniformly recurrent. Hence ultimately periodic 
episturmian words are (purely) periodic. 

Note. Hereafter, we refer to a standard episturmian word as an epistandard word, for simplicity. 

To study episturmian words, Justin and Pirillo [23j introduced episturmian morphisms. In 
particular they proved that these morphisms, which we recall below, are precisely the morphisms 
that preserve the set of aperiodic episturmian words. 

2.3 Episturmian morphisms 

Let us recall that given an alphabet A, a morphism f on A is a map from A* to A* such that 
f(uv) = f(u)f(v) for any words u, v over A. A morphism on A is entirely defined by the images 
of letters in A. All morphisms considered in this paper will be non-erasing: the image of any non- 
empty word is never empty. Hence the action of a morphism / on A* can be naturally extended 
to infinite words; that is, if x = £13:2X3 • • • £ A^, then /(x) = f(xi)f {x?)f (x^) 

In what follows, we will denote the composition of morphisms by juxtaposition as for concate- 
nation of words. 

Episturmian morphisms are the compositions of the permutation morphisms (i.e., the morphisms 
/ such that f(A) = A) and the morphisms L a and R a where, for all a € A: 



Here we will work only on pure episturmian morphisms, i.e., morphisms obtained by composition 
of elements of the sets: 



Note. In [23], the morphism L a (resp. R a ) is denoted by ip a (resp. rji a ). We adopt the current 
notation to emphasize the action of L a (resp. R a ) when applied to a word, which consists of placing 
an occurrence of the letter a on the left (resp. right) of each occurrence of any letter different from a. 

Epistandard morphisms (resp. pure episturmian morphisms, pure epistandard morphisms) are 
the morphisms obtained by concatenation of morphisms in Lj^ and permutations on A (resp. in 
£.4 U 7^,4, in Cji). Note that the episturmian morphisms are exactly the Sturmian morphisms when 
A is a 2-letter alphabet. 

2.4 Morphic decomposition of episturmian words 

Justin and Pirillo [23] proved the following insightful characterizations of epistandard and epis- 
turmian words (see Theorem 12.11 below), which show that any episturmian word can be infinitely 
decomposed over the set of pure episturmian morphisms. 

The statement of Theorem 12.11 needs some extra definitions and notation. First we define the 
following new alphabet, A = {x \ x € A}. A letter x is considered to be x with spin R, whilst 




for all b 7^ a in A. 



C A = {L, 



a 



a € .4} and TZj^ = {Ra | a £ A}. 
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x itself has spin L. A finite or infinite word over A U A is called a spinned word. To ease the 
reading, we sometimes call a letter with spin L (resp. spin R) an L-spinned (resp. i?-spinned) 
letter. By extension, an L-spinned (resp. i?-spinned) word is a word having only letters with spin 
L (resp. spin R). 

The opposite w of a finite or infinite spinned word w is obtained from w by exchanging all spins 
in w. For instance, if v = aba, then v = aba. When v S A + , then its opposite v £ A + is an 
L!-spinned word and we set e = e. Note that, given a finite or infinite word w = W1W2 ■ ■ ■ over 
A, we sometimes denote w = w\W2 • ■ ■ any spinned word such that Wi = Wi if w>i has spin L and 
Wi = uii if Wi has spin R. Such a word w is called a spinned version of 

Note. In Justin and Pirillo's original papers, spins are and 1 instead of L and R. It is convenient 
here to change this vision of the spins because of the relationship with episturmian morphisms, 
which we now recall. 

For a £ A, let fj, a = L a and fi a = Ra- This operator /i can be naturally extended (as done 
in [23]) to a morphism mapping any word over (A U A) into a pure episturmian morphism: for a 
spinned finite word w = w\ ■ ■ ■ w n over AU A, fJ,w = /a^ ■ ■ ■ [i^ n (/i e is the identity morphism) . We 
will say that the word w directs or is a directive word of the morphism \i w . The following result 
extends the notion of directive words to infinite episturmian words. 

Theorem 2.1. [23] 

i) An infinite word s £ A w is epistandard if and only if there exists an infinite word A = 
X1X2X3 ■ ■ ■ over A and an infinite sequence (s^) n >Q of infinite words such that = s and 
for all n>l, s^" 1 ) = L Xn (s^). 

ii) An infinite word t G A u is episturmian if and only if there exists a spinned infinite word 
A = X1X2X3 • • • over and an infinite sequence (t( n )) n >o of recurrent infinite words such 

that t(°) = t and for all n>l, t( n-1 > = fJ-xJt (n) )- 

For any epistandard word (resp. episturmian word) t and L-spinned (resp. spinned) infinite word 
A (resp. A) satisfying the conditions of the above theorem, we say that A (resp. A) is a (spinned) 
directive word for t or that t is directed by A (resp. A). 

Remark 2.2. It follows immediately from Theorem 1 2 . 1 1 that if t is an episturmian word directed 
by a spinned infinite word A, then each t( n ) (as defined in part ii)) is an episturmian word directed 
by T n (A) = x n+1 x n+2 x n+3 

By Theorem 1 in [12] (see also Theorem 14. 51 later) . any epistandard word has a unique L-spinned 
directive word, but also has infinitely many other directive words (see [231 ESI H]). For example, 
the Tribonacci word (or Rauzy word [39] ) is directed by {abc) u and also by (abc^abcfabc) 1 ^ for each 
n > 0, as well as infinitely many other spinned words. More generally, by Proposition 3.11 in [23J, 
any spinned infinite word A having infinitely many L-spinned letters directs a unique episturmian 
word t beginning with the left-most L-spinned letter in A. Moreover, by one of the main results in 
[19] (see Theorem 13.21 later), t has infinitely many other directive words. 

The following important fact links the two parts of Theorem 12.11 

Fact 2.3. [23j If t is an episturmian word directed by a spinned version A of an L-spinned infinite 
word A, then t has exactly the same set of factors as the (unique) epistandard word s directed by 
A. 

Moreover, with the same notation as in the above remark, the episturmian word t is periodic if 
and only if the epistandard word s is periodic, and this holds if and only if [Ult(A)| = 1 (see [23l 
Prop. 2.9]). More precisely, a periodic episturmian word takes the form (^(x)) w for some finite 
spinned word w and letter x. 
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Note. Sturmian words are precisely the aperiodic episturmian words on a 2-letter alphabet. 

When an episturmian word is aperiodic, we have the following fundamental link between the 
words (t( ra )) n >o and the spinned infinite word A occurring in Theorem 12.11 if a n is the first let- 
ter of t( n \ then fJ,z r ..£ n (a n ) is a prefix of t and the sequence {^x 1 -x n { a n))n>i is not ultimately 
constant (since A is not ultimately constant), then t = lim n _»oo x«( a n)- This fact is a slight 
generalization of a result of Risley and Zamboni Prop. III. 7] on S-adic representations for 
characteristic Arnoux-Rauzy sequences. See also the recent paper [7] for S-adic representations of 
Sturmian words. Note that S-adic dynamical systems were introduced by Ferenczi [T5] as minimal 
dynamical systems (e.g., see [38J) generated by a finite number of substitutions. In the case of 
episturmian words, the notion itself is actually a reformulation of the well-known Rauzy rules, as 
studied in [40]. In fact, it is well-known that the subshift of an aperiodic episturmian word t (i.e., 
the topological closure of the shift orbit of t) is a minimal dynamical system, i.e., it consists of all 
the episturmian words with the same set of factors as t. 

3 Useful results on directive words 

Notions concerning directive words of episturmian words and morphisms have been recalled in 
the previous section. Two natural questions concerning these words are: When do two distinct 
finite spinned words direct the same episturmian morphism? When do two distinct spinned infinite 
words direct the same unique episturmian word? In this section we recall existing answers to these 
questions. We also present a way to uniquely define any episturmian word through a normalization 
of its directive words. This powerful tool was recently introduced in our papers [T^t |2"B]. 

3.1 Presentation versus block-equivalence 

Generalizing a study of the monoid of Sturmian morphisms by Seebold [47], the third author [41] 
answered the question: 'When do two distinct finite spinned words direct the same episturmian 
morphism?" by giving a presentation of the monoid of episturmian morphisms. This result was 
reformulated in [32] using another set of generators and it was independently and differently treated 
in [25]. As a direct consequence, one can see that the monoid of pure epistandard morphisms is 
a free monoid and one can obtain the following presentation of the monoid of pure episturmian 
morphisms: 

Theorem 3.1. (direct consequence of \42\ Prop. 6.5]; reformulation of |25[ Th. 2.2]) 

The monoid of pure episturmian morphisms with {L a ,R a \ a £ A} as a set of generators has 
the following presentation: 

Ra±^a2 ' ' ' -^afc-^ai — L ai L a2 ■ ■ ■ L ak R ai 

where k > 1 is an integer and a±, . . . , £ A with a± ^ a% for all i, 2 < i < k. 

This result means that two different compositions of morphisms in Cjs, U TZjs, yield the same 
pure episturmian morphism if and only if one composition can be deduced from the other in a 
rewriting system, called the block- equivalence in [25]. Although Theorem 13.11 allows us to show that 
many properties of episturmian words are linked to properties of episturmian morphisms, it will be 
convenient for us to have in mind the block-equivalence that we now recall. 

A word of the form xvx, where x € A and v £ (A \ {%})*, is called a (x-based) block. A (x- 
based) block-transformation is the replacement in a spinned word of an occurrence of xvx (where 
xvx is a block) by xvx or vice-versa. Two finite spinned words w, w' are said to be block-equivalent 
if we can pass from one to the other by a (possibly empty) chain of block-transformations, in 
which case we write w = w'. For example, babcbac and babcbac are block-equivalent because 
babcbac — ► babcbac — > babcbac and vice- versa. The block-equivalence is an equivalence relation over 
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spinned words, and moreover one can observe that if w = w 1 then w and w' are spinned versions of 
the same word over A. 

Theorem 13.11 can be reformulated in terms of block-equivalence: 

Theorem 13.11 Let w, w' be two spinned words over AU A. Then fi w = fi w / if and only if w = w 1 . 

3.2 Words directing the same episturmian word 

Using the block-equivalence notion, the question: "When do two distinct spinned infinite words 
direct the same unique episturmian word?" was almost completely solved by Justin and Pirillo in 
|25j for bi-infinite episturmian words, i.e., episturmian words with letters indexed by Z (and not 
by N as we consider here). More recently, in [19], we showed that Justin and Pirillo's results on 
directive words of bi-infinite episturmian words are still valid for words directing (right-infinite) 
episturmian words. We also established the following complete characterization of pairs of spinned 
infinite words directing the same unique episturmian word. Not only does our characterization 
provide the relative forms of two spinned infinite words directing the same episturmian word, but 
it also fully solves the periodic case, which was only partially solved in |25j . 

Theorem 3.2. |19j Given two spinned infinite words Ai and A2, the following assertions are 
equivalent. 

i) Ai and A2 direct the same right-infinite episturmian word; 

ii) Ai and A2 direct the same bi-infinite episturmian word; 

Hi) One of the following cases holds for some i,j such that {i,j} = {1, 2}; 

1. Ai = n n >i v n, Aj = Iln>l z n w ^ere (v n )n>i, (z n )n>i are spinned words such that fj, Vn = 
fi Zn for all n > 1; 

2. Ai = wxY\ n>l v n x n , Aj = w'x Iln>i v n Xn where w, w' are spinned words such that 
fJ-w = fJ"u}'> x i s an L-spinned letter, (i> ra )n>i is a sequence of non-empty x-free L-spinned 
words, and (x n )n>i, {%n)n>l are sequences of non-empty spinned words over {x,x} such 
that, for all n > 1, \x n \ = \x n \ and \x n \ x = \x n \ x ; 

3. Ai = lox and A2 = io'y where w, w' are spinned words and x € {x,x} u , y € {y,y} w 
are spinned infinite words for some letters x, y such that [A w (x) = (i w >(y). 

In items Q] and [2] of Theorem 13.21 the two considered directive words are spinned versions of 
the same L-spinned word. This does not hold in item[3l which concerns only periodic episturmian 
words. In particular, we make the following observation: 

Fact 3.3. If an aperiodic episturmian word is directed by two spinned words Ai and A2, then Ai 
and A2 are spinned versions of the same L-spinned word A. 

As an example of item [3j one can consider the periodic episturmian word {bcba) u which is 
directed by both bca u and bac^ . Note also that (bcba)^ is epistandard and has the same set of 
factors as the epistandard word (babe) 1 " 1 directed by bac^ ' . Actually, in view of Fact I2.3| we observe 
the following: 

Fact 3.4. The subshift of any aperiodic episturmian word contains a unique (aperiodic) epistan- 
dard word, whereas the subshift of a periodic episturmian word contains exactly two (periodic) 
epistandard words, except if this word is 0? with a a letter. 
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3.3 Normalized directive word of an episturmian word 

Items [2] and [3] of Theorem 13.21 show that any episturmian word is directed by a spinned word having 
infinitely many L-spinned letters, but also by a spinned infinite word having both infinitely many 
L-spinned letters and infinitely many i?-spinned letters. To emphasize the importance of these 
facts, let us recall from Proposition 3.11 in [23J that if A is a spinned infinite word over ^lU«4 
with infinitely many L-spinned letters, then there exists a unique episturmian word t directed by 
A. Unicity comes from the fact that the first letter of t is fixed by the first L-spinned letter in A. 

To work on Sturmian words, Berthe, Holton and Zamboni [7J proved that any Sturmian word has 
a unique directive word over {a, b, a, 6} containing infinitely many L-spinned letters but no factor 
of the form ab n a or ba n b with n an integer. Using Theorems 13.11 and 13.21 we recently generalized 
this result to episturmian words: 

Theorem 3.5. |19|. [28] Any episturmian word t has a spinned directive word containing infinitely 
many L-spinned letters, but no factor in \^j a€ ^aA*a. Such a directive word is unique ift is aperiodic. 

Note that unicity does not necessarily hold for periodic episturmian words. For example, the 
periodic episturmian word (ab) u = L a {ff) = R^a 1 ^) is directed both by ab u and by ba u (L a (b) = 
ab = R (a)). 

A directive word of an aperiodic episturmian word t with the above property is called the 
normalized directive word of t. We extend this definition to morphisms: a finite spinned word w is 
said to be a normalized directive word of the morphism /j, w if w has no factor in [J a£ ^aA*a. 

One can observe from Theorem 13.11 that for any morphism in L a C^R a , we can find another 
decomposition of the morphism in the set R a TZ^L a . Equivalently, for any spinned word in aA*a, 
there exists a word w' in a A* a such that fi w = fj, w i. This was the main idea used in the proof of 
Theorem 13.51 

Example 3.6. Let / be the pure episturmian morphism with directive word abcbabacbaca. By The- 
orem EH ll- azi - a - ca = Vacbaca = Vacbaca and hence / = abcbabacbaca = V abcbabacbaca and abcbabacbaca is 

the normalized directive word of /. 

3.4 Episturmian words having a unique directive word 

Using our characterization of pairs of words directing the same episturmian word (Theorem 13. 2p 
together with normalization (Theorem 13. 5ft . we recently characterized episturmian words having a 
unique directive word. 

Theorem 3.7. [19] An episturmian word has a unique directive word if and only if its (normalized) 
directive word contains 1) infinitely many L-spinned letters, 2) infinitely many R-spinned letters, 
3) no factor in \J a £A®'-A* a > 4) no factor in \J ae ^aA*a. Such an episturmian word is necessarily 
aperiodic. 

As an example, a particular family of episturmian words having unique directive words consists 
of those directed by regular wavy words, i.e., spinned infinite words having both infinitely many 
L-spinned letters and infinitely many i?-spinned letters such that each letter occurs with the same 
spin everywhere in the directive word. More formally, a spinned version w of a finite or infinite 
word uu is said to be regular if, for each letter x € Alph(w), all occurrences of x in w have the same 
spin (L or R). For example, abaacb and {abc)^ are regular, whereas abaacb and (aba)^ are not 
regular. In [25], a spinned infinite word is said to be wavy if it contains infinitely many L-spinned 
letters and infinitely many ii-spinned letters. For example, the two preceding infinite words are 
wavy. 

In the Sturmian case, we have: 



8 



Proposition 3.8. |19j Any Sturmian word has a unique spinned directive word or infinitely many 
spinned directive words. Moreover, a Sturmian word has a unique directive word if and only if its 
(normalized) directive word is regular wavy. 

In the next section, we shall see that any episturmian word having a unique directive word 
is necessarily non-quasiperiodic. This will follow from Theorem 13.71 and our characterizations of 
quasiperiodic episturmian words (Theorems 14.191 T4.281 and I4.29| ). 

4 Quasiperiodicity of episturmian words 

4.1 Quasiperiodicity 

Recall (from [U l2"ij 155] ) that a finite or infinite word w is quasiperiodic if it can be constructed by 
concatenations and superpositions of one of its proper factors u, which is called a quasiperiod of w 
(or the smallest quasiperiod of w when it is of minimal length) . We also say that u covers w or that w 
is u-quasiperiodic. For example, the word w = abaababaabaababaaba has aba, abaaba, abaababaaba 
as quasiperiods, and the smallest quasiperiod of w is aba. Words that are not quasiperiodic are 
naturally called non-quasiperiodic words. 

When defining infinite quasiperiodic words, for convenience, we consider the words preceding 
the occurrences of a quasiperiod: an infinite word w is quasiperiodic if and only if there exist a 
finite word u and words (p n )n>o such that po = e, \p n \ < |p n +l| < \Pn u \i an d p n u is a prefix of w 
for all n > 0. Then u is a quasiperiod of w and we say that the sequence (pnu)n>o is a covering 
sequence of prefixes of the word w. Necessarily, any quasiperiod of a quasiperiodic word must be a 
prefix of it. 

Readers will find several examples of infinite quasiperiodic words in |27l [32 , 33J . Let us mention 
for instance that the Fibonacci word, directed by (ab) u , is a&a-quasiperiodic (see |27J). 
Let us now recall a simple, yet important, fact about quasiperiodic words. 

Fact 4.1. If to is a (finite or infinite) u-quasiperiodic word and / is a non-erasing morphism, then 
f(w) is /(li)-quasiperiodic. 

Note that the converse of this fact is not true. For example, let / = R a and w = ab u , then 
R a (w) = a(bay. The word R a (w) is covered by R a {ab), but w is not covered by ab. 

4.2 Return words and quasiperiodicity 

We now use the notion of a 'return word' to give an equivalent definition of quasiperiodicity (see 
Lemma l4.3p . which proves to be a useful tool for studying quasiperiodicity in episturmian words. 

Return words were introduced independently by Durand 13J and by Holton and Zamboni [20] 
when studying primitive substitutive sequences. Such words can be defined in the following way. 

Definition 4.2. Let v be a recurrent factor of an infinite word w = W1W2W3 ■ ■ ■ , starting at 
positions n\ < r&2 < ns ■ ■ ■ in w. Then each word r, = w ni w ni+ i ■ ■ ■ w ni+1 -\ is called a return to v 
in w. 

That is, a return to v in w is a non-empty factor of w beginning at an occurrence of v and 
ending exactly before the next occurrence of v in w. Thus, if r is a return to v in w, then rv is a 
factor of w that contains exactly two occurrences of v, one as a prefix and one as a suffix. As any 
episturmian word t is uniformly recurrent [12] , each factor of t has only a finite number of different 
returns (for more details see Theorem 14. 7p . 

Note. A return to v in w always has v as a prefix or is a prefix of v. In particular, we observe that 
a return to v is not necessarily longer than v, in which case v has overlapping occurrences in w 
(i.e., vz~ 1 v is a factor of w for some non-empty word z). We say that v has adjacent occurrences in 
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w if vv is a factor of w. In this case, if v is primitive (i.e., not an integer power of a shorter word), 
then v is a return to itself; otherwise, the corresponding return to v is the primitive root of v. 

In terms of return words, we have the following equivalent definition of a quasiperiodic infinite 
word. 

Lemma 4.3. A finite word v is a quasiperiod of an infinite word w if and only if v is a recurrent 
prefix of w such that any return to v in w has length at most \v\. 

Proof. If v is a quasiperiod of w, then v is a prefix of w and its occurrences entirely cover w. That 
is, v is recurrent in w and successive occurrences of v in w are either adjacent or overlap, and 
hence any return to v has length at most \v\. Conversely, if v is a recurrent prefix of w such that 
any return to v has length at most \v\, then successive occurrences of v in w are either adjacent or 
overlap, and hence entirely cover w. Thus w is f-quasiperiodic. □ 

Immediately: 

Corollary 4.4. An infinite word w is quasiperiodic if and only if there exists a recurrent prefix v 
of w such that any return to v in w has length at most \v\, in which case v is a quasiperiod ofw. 
Moreover, the shortest such prefix v is the smallest quasiperiod ofw. 

A noteworthy fact is that a quasiperiodic infinite word is not necessarily recurrent [33J , although 
it must have a prefix that is recurrent in it. 

4.3 Return words and palindromic closure in episturmian words 

In this section we recall a characterization of return words in episturmian words, given by Justin 
and Vuillon in [26]. For this, we first recall the construction of epistandard words using palin- 
dromic right- closure as well as some related properties from |12[ [23] that will be used throughout 
Sections El to E31 

The palindromic right-closure u>W of a finite word w is the (unique) shortest palindrome having 
w as a prefix (see [II]). That is, = wv~ l w where v is the longest palindromic suffix of w. 

The iterated palindromic closure function [22], denoted by Pal, is defined recursively as follows. 
Set Pal(e) = e and, for any word w and letter x, define Pal(wx) = (Pal(w)x)( + \ For instance, 
Pal(abc) = (Pal(ab)c)^ = (abacp + ' = abacaba. 

Generalizing a construction given in [H] for standard Sturmian words, Droubay, Justin and 
Pirillo established the following characterization of epistandard words. 

Theorem 4.5. [32] An infinite word s G A u is epistandard if and only if there exists an infinite 
word A = X1X2X3 ■ ■ ■ (xi € A) such that s = lim.,^00 Pal(x\ ■ ■ ■ x n ). 

Note that the palindromes Pal{x\ ■ ■ ■ x n ) are very often denoted by ii n +i in the literature. 

In [23] , Justin and Pirillo showed that the word A is exactly the directive word of s as it occurs 
in Theorem 12. 1[ Moreover, by construction, A uniquely determines the epistandard word s. Notice 
also that by construction, the words {Pal{x\ ■ ■ ■ Xj))i>o are exactly the palindromic prefixes of s. 

There exist many relations between palindromes and episturmian morphisms. The following 
ones will be useful in the next few sections. First recall from [221 [23J that we have 

Pal(wv) = fi w (Pal(v))Pal(w) for any words w, v. (4-1) 

In particular, for any x a letter, Pal(xv) = L x {Pal{v))x and Pal{wx) = fi w (x)Pal(w). 
For letters (^j)i<j<i, formula (|4.ip inductively leads to: 

Pal(xi ■ ■ -Xi) = Mxi-^-iC^i) ■ ■ ■ fJ'x 1 (x 2 )x 1 = Yl Vxi-xj-tixj)- (4.2) 

l<j<i 
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Note that by convention, x\ ■ ■ ■ xq = e in the above product. 

Now let w = x\X2 ■ ■ ■ x n be a spinned version of w = x\X2 ■ ■ ■ x n (viewed as a prefix of a spinned 
version A of A). Then, for any finite word v, we have 

Hw{v) = S^fiw^Sa where = ]J fJ, Xl -xi-i (4.3) 

i=n,...,l 

The word is called the shifting factor of [25]. Observe that is a prefix of Pal(w); in 
particular 5^ = Pal(w) by equation (|4.2p . Note also that fia{v) = T^ s ^(/j w (v)). 

For example, for w = abca, we have S$ = fJ> a ,bc(, a )f J >a{b) = abacabaab. Thus since fJ- a bca(ca) = 
abacabaab.acabacaba, /j, ab - ca (ca) = T 9 (n a bca(ca)) = acabacaba. abacabaab. 

Likewise, for any infinite word y £ .A"', 

M*(y) = s^Wy)- (4.4) 

This formula used with w = w shows that: 
Fact 4.6. Any word of the form fj, w (y) with y infinite begins with Pal(w). 

Amongst the numerous interests of the palindromes {Pal(x\ ■ ■ -x n )) n >o, we have the following 
explicit characterization of the returns to any factor of an epistandard word. 

Theorem 4.7. [26J Suppose s is an epistandard word directed by A = x\X2X% ■ ■ ■ with Xi G A, 
and consider any factor v of s. If the word it n +i = Pal{x\ ■ ■ ■ x n ) is the shortest palindromic 
prefix of s containing v with u n+ \ = fvg, then the returns to v are given by J ' fj, Xl ... Xn (x) if where 
x G Alph(x n+ ix n+2 ■ ■ •). 

Because of the uniform recurrence of episturmian words, the following simple but important fact 
about return words holds. 

Lemma 4.8. Suppose s is an epistandard word and let t be any episturmian word in the subshift 
of s. Then, for any factor v of s, r is a return to v in s if and only if r is a return to v in t. 

That is, the returns to any factor v of an epistandard word s are the same as the returns to v 
as a factor of any episturmian word t with the same set of factors as s. Hereafter, we often use the 
above result without reference to it. 

The following result is particularly useful in Sections I4.3H4.61 

Proposition 4.9. Suppose s is an epistandard word directed by A = X1X2X3 ■ ■ ■ with X{ E A, 
and let t be an episturmian word directed by a spinned version of A. Then t begins with 
Pal{x\ ■ ■ ■ x n ) for some non-negative integer n if and only if t has a directive word of the form 
A — x\ • • • x n x n +ix n _|_2 ■ • • where the prefix L -spinned. 

Proof When t is directed by a spinned version of A of the form A = X1X2 ■ ■ ■ x n x n +ix n +2 ■ ■ ■ where 
the prefix X1X2 ■ • -x n is L-spinned, t begins with Pal{x\ ■ ■ ■ x n ) by Fact 14.61 

Conversely, suppose Pal(x\ ■ ■ prefix of t, and suppose that t is directed by A = (ast)i>l 

(a spinned version of A = (xi)i>x). From Theorem 13.21 we can suppose that A contains infinitely 
many L-spinned letters. If n = 0, there is nothing to prove. Else let x\ = a. Assume first that 
x\ = a. Let k be the smallest positive integer such that Xk € A. Since t begins with the letter a 
(which is the first letter of Pal(x\ ■ ■ ■ x n )), we have x^ = a. Then A = ax2 ■ ■ ■ x^-iax^i • • • , and 
hence by Theorem 13. li t is also directed by the spinned infinite word beginning with 0x2 ■ ■ ■ Xj-\a, 
with j < k. Therefore we may assume from now on that x\ = a. Let t' be the episturmian word 
directed by T(A) = (a?i)i>2- It is easily seen from the equality Pal{x\ ■ ■ ■ x n ) = L a (Pal(x2 ■ • • x n ))a 
that Pal(x2 ■ ■ prefix of t'. Hence by induction t' is directed by a spinned version of 

T(A) with X2 ' ' ' %n ^S Si prefix. And so t is directed by a spinned version of A of the form 
X1X2 • ■ ■ x n x n x n+ i ■ ■ ■ where the prefix X1X2 ■ ■ ■ x n is L-spinned. □ 
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4.4 All epistandard words are quasiperiodic 

A first consequence of Theorem 14.71 is that any epistandard word is quasiperiodic. More precisely: 

Theorem 4.10. Suppose s is an epistandard word with directive word A = X1X2X3 ■ ■ ■ with Xi £ A, 
and let m be the smallest positive integer such that Alph(xiX2 • • • x m ) = Alph(s). Then, for all 
n>m, Pal(x\ ■ ■ ■ x n ) is a quasiperiod of s. 

Proof of Theorem \4-10[ We suppose that s is an epistandard word with directive word A = 
X1X2X3 ■■■ , Xi E A. Let m be the smallest positive integer such that Alph(xiX2 • • • x m ) = Alph(s). 
Clearly, for n < m, Pal{x\ ■ ■ ■ x n ) cannot be a quasiperiod of s since it does not contain all of the 
letters in Alph(s). 

Now let n > m. We know that Pal{x\ ■ ■ ■ x n ) is a prefix of s. Suppose that for some k > n, 
Pal{x\ ■ ■ ■ Xk) is covered by Pal{x\ ■ ■ ■ x n ). Since by choice of m, Xk+i belongs to {xi, • • • , Xk}, 
we have \(Pal(x\ ■ ■ ■ £fc) x fc+i)^ I — 2\Pal(x\ ■ ■ ■ x^)\. Hence since Pal(x± ■ ■ ■ Xk) is a palindrome, 
Pal{x\ ■ ■ ■ Xk+i) is covered by Pal(x\ ■ ■ ■ x^), and so by Pal{x\ ■ ■ ■ x n ). The result follows from 
Theorem 14.51 by induction. 

□ 

Example 4.11. Recall the Tribonacci word: 

r = abacabaabacababacabaabacabacabaabaca ■ ■ ■ , 

which is the epistandard word directed by (abc)^ . Observe that Pal(abc) = abacaba is the shortest 
palindromic prefix of r such that Alph(PaZ (abc)) = Alph(r) = {a,b,c}. By Theorem 14. 7\ the 
returns to Pal(abc) in r are: ^ a bc{o) = abacaba, ^ a b c {b) = abacab, n a bc(c) = abac, none of which are 
longer than Pal{abc). Hence r is a&aca&a-quasiperiodic; in fact Pal{abc) = n a bc{ a ) = abacaba is the 
smallest quasiperiod of r since its prefixes abac, abaca, abacab have returns longer than themselves. 
This latter fact is also evident from our description of quasiperiods of a (quasiperiodic) episturmian 
word (Theorem 14. 191 to follow). 

More generally, the k-bonacci word, which is directed by (a±a2 ■ - - ak) u , is quasiperiodic with 
smallest quasiperiod Pal(x± ■ ■ ■ Xk)- This fact was also observed in [27] by noting that the k- 
bonacci word is generated by the morphism tp^ on {a\,a2, ■ ■ ■ ,afc} defined by (fk(ai) = aidj+i for 
all i 7^ k, and (fk(ak) = «i- 

Remark 4.12. From Fact I4.ll and Theorem 14.101 we immediately deduce that (p(s) is a quasiperi- 
odic infinite word for any epistandard word s. Moreover, if <p is a pure episturmian morphism, 
then 93 (s) is a quasiperiodic episturmian word. More precisely, Hw(s) is a quasiperiodic episturmian 
word for any epistandard word s and spinned word iv. Such an episturmian word is directed by 
a spinned infinite word of the form wA where A is the L-spinned directive word of s. Hence, if 
an episturmian word t is directed by a spinned infinite word with all spins ultimately L, then t is 
quasiperiodic. 

More generally, we have the following consequence of Theorem 14.101 (a converse of this result is 
stated in Theorem 14.191 an d a generalization is provided by Theorem 15.11 later). 

Corollary 4.13. // an episturmian word t is directed by A = ivvy for some spinned words w, y 

and L-spinned word v such that Alph(v) = Alph(?;y), then t is quasiperiodic. 

Moreover, any word of the form fi^(Pal(v))p with p a prefix of S^ 1 Pal(w) is a quasiperiod oft. 

Proof. Let (t( n )) ra >o be the infinite sequence of episturmian words associated to t and A in Theo- 
rem ETJ Then by Proposition 14.91 the episturmian word t^ w \\ which is directed by vy, begins with 
the palindromic prefix Pal(y) of the epistandard word s" w \' directed by the L-spinned version of 
vy. Moreover, since Alph(-u) = Alph(uy), Pal(v) is a quasiperiod of s^ w " by Theorem 14.101 By 
Fact 12.31 s^ w " is in the subshift of t^D. It follows from Lemma 14.81 that Pal(v) has the same 
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returns in t" w " as it does in s^'\ and since it is a prefix of t^ w ^ , it is also a quasiperiod of t^' w ^. 
Therefore by Fact 14, li t = Ha(t^ w ^) is quasiperiodic and fj,^(Pal(v)) is a quasiperiod of t. 

Now by formula (|4.4p and by Fact 14.61 each occurrence of n^{Pal{v)) is followed by S^Pa^w) 
in t, and so is followed by p for any prefix p of S'r 1 Pal(w). This shows that ^{Pal{v))p is a 
quasiperiod of t. □ 



4.5 Ultimate quasiperiods 

Theorem l4.10l shows that long enough palindromic prefixes of an epistandard word s are quasiperiods 
of s. Our next goal is to extend Theorem 14. 1UI by describing all the quasiperiods of any (quasiperi- 
odic) episturmian word. Several lemmas are required, using the notion of ultimate quasiperiods 
that we now define. 

A factor v of an infinite word w is said to be an ultimate quasiperiod of w if it is a quasiperiod of 
a suffix of w. In particular, when w is a uniformly recurrent word (which is the case for episturmian 
words) , a recurrent factor v in w is an ultimate quasiperiod of w if any return to v in w has length 
at most \v\. 

Clearly, if an ultimate quasiperiod is a prefix of w, then w is quasiperiodic by Lemma [4.31 Also 
note that the set of quasiperiods of a (quasiperiodic) infinite word w consists of all its ultimate 
quasiperiods that are prefixes of it. 

Remark that we have 
Proposition 4.14. All episturmian words are ultimately quasiperiodic. 

Proof. From Theorem 14. 101 all epistandard words are quasiperiodic. Let s be an epistandard word 
and let q be a quasiperiod of s. Then any episturmian word t in the subshift of s has a suffix t' 
beginning with q. Since t has the same set of factors than s, we can find infinitely many prefixes 
oft' which are covered by q, i.e., t' is quasiperiodic. Hence t is ultimately quasiperiodic. □ 

Now we recall some insight about palindromic closure that will be useful later. As previously, 
let s denote an epistandard word with directive word A = x\X2X% ■ ■ ■ with Xi € A. For n > let 
u n+ i = Pal{x\ ■ ■ ■ x n ). Note in particular that u\ = e and by Theorem 14.51 s = lim n _»oo u n . 

As in [261 [23], let us define P(i) = sup{j < i \ Xj = x{\ if this number exists, undefined 
otherwise. That is, if x\ = a, then P(i) is the position of the right-most occurrence of the letter a 
in the prefix x\X2 ■ ■ ■ of the directive word A. For instance, if A = (a6c) w , P(i) = i — 3 for any 
i > 4 and P(l), P(2), and P(3) are undefined. 

From the definitions of palindromic closure and the palindromes (uj)j>i, it follows that, for all 
i > 1, 

\UiXiUi if Xi £ Alph(iij), 
Ui+i = < _! . (4.5) 

yuiU p ^Ui otherwise. 

Therefore, using Theorem 14.71 with / = g = e, we deduce that for n > 0, the length of the 
longest return r n+ \ to u n+ \ in s satisfies 



\fn+l\ 



\u n+ \ \ + 1 if some x G Alph(s) does not occur in u n+ \, 
I u„ J otherwise, 



where p n = inf{P(i) | i > n + 1} (see also \26\ Lem. 5.6]). 

In other words, p n = sup{i < n \ Alph(xj • • • x n ) = Alph(xj • • • x n ■ ■ ■ )}. For instance, if A = 
{abc) w , then p^ = 1. 

The next lemma gives the set of all ultimate quasiperiods of any episturmian word t. It simply 
amounts to determining all of the factors of t that have no returns longer than themselves. 
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Notation. Hereafter, we denote by F(w) the set of factors of a finite or infinite word w. 

Lemma 4.15. Suppose s is an epistandard word directed by A = x\x 2 x^ ■ ■ ■ , Xj G A. Let m be the 
smallest positive integer such that Alph(xiX2 • • • x m ) = Alph(s) and let ii n +i = Pal(x% ■ ■ ■ x n ) for 
all n > 0. Then the set of all ultimate quasiperiods of any episturmian word t in the subshift of s 
is given by 

Q = [J Qn with Q n = {q G F{u n+1 ) \ \q\ > \u n+1 \ - \u Pn \}, 

n>m 

where p n = sup{i < n | Alph(xj • • • x n ) = Alph(xj ■■■ x n ■■■)}. 

Proof. First observe that the number p n exists for all n > m. Indeed, the set {i < n \ 
Alph(xj • • • x n ) = Alph(xj ■ • • x n ■ ■ ■ )} is not empty, as by the definition of m it contains i = 1. 

Clearly, if n < m, then no factor of ii n +i can be an ultimate quasiperiod of t since u n +i does 
not contain all of the letters in Alph(t) = Alph(A). So let us now fix n > m. For any q G Q n , 
\q\ > |^n+i| — \ u p„\ > \ u n\- Indeed from formula (14. 5j) . if n = m then u n+ \ = u n x n u n , and if 
n > m + 1, then u n+ \ = UnUp^Un where P(n) > p n . Hence q G F{u n+ i) \ F(u n ), i.e., u n+ \ is the 
shortest palindromic prefix of s containing q. Therefore, by Theorem 14.71 the returns to q G Q n 
are a certain circular shift of the returns to n n +i and the longest of these return words has length 
|-u n+ i| — \u Pn \. Thus any return to q has length at most \q\; whence q is an ultimate quasiperiod 
of t. It remains to show that any other factor w G F(u n+ i) \ F(u n ) with \w\ < [ti n +i| — \ u p n \ * s 
not an ultimate quasiperiod. This is clearly true since the longest return to any such w has length 
l^n+il — \ u p n \ > \ w \- That is, at least one of the returns to w is longer than it, which implies that 
w is not an ultimate quasiperiod of t. □ 

Example 4.16. Let us consider the Fibonacci case. As observed in |10| . \u n \ = F n+ \ — 2 for 
all n > 1 where F^ is the k-th Fibonacci number (iq = 1, F2 = 2, F^ = F^-i + -Ffc-2 for 
k > 2). Since for n > 2, p n = n — 1 and |it n +i| — \up n \ = F n+ i, the ultimate quasiperiods of the 
Fibonacci word are the factors of of length between F^ and -Ffc+i — 2 for all k > 3. The first few 
ultimate quasiperiods of the Fibonacci word (in order of increasing length) are: aba (u^), abaab, 
baaba, abaaba (U4), abaababa, baababaa, aababaab, ababaaba, abaababaa, baababaab, aababaaba, 
abaababaab, baababaaba, abaababaaba (u^), ... 

Lemma 14.151 yields the following trivial characterization of quasiperiodic episturmian words. 

Corollary 4.17. Suppose s is an epistandard word with set of ultimate quasiperiods Q. Then an 
episturmian word t in the subshift of s is quasiperiodic if and only if some v G Q is a prefix of t . 

Moreover, Lemma l4,15l can be reformulated (more nicely) using episturmian morphisms, together 
with the iterated palindromic closure function. 

Lemma 4.18. Suppose s is an epistandard word directed by A G A w . Then the set of ultimate 
quasiperiods of any episturmian word t in the subshift of s is the set of all words 

q G F(Pal(wv)), with \q\ > |^(Pa/(v))|, 

where w, v are words such that A = wvy with Alph(u) = Alph(uy). 

Proof. Let A = X1X2X3 ■ ■ ■ and let m be the smallest positive integer such that Alph(xi2;2 • • • x m ) = 
Alph(s). Then, by Lemma 14.151 the set of all ultimate quasiperiods of s (and hence of t) is given 
by Q = U„> m Qn with 

Q n = {q G F(u n+1 ) I \q\ > \u n+1 \ - \u Pn \}, 

where p n = sup{« < n \ Alph(xj • • • x n ) = Alph(xj • • • x n ■ ■ ■ )}. So, for fixed n > m, A can be written 
as A = wvy where w = x\-- x Pn -\, v = x Pn ■ ■ ■ x n , y = x n+ ix n+2 ■■■ , and Alph(v) = Alph(?;y) (by 
the definition of p n ). Then u n+ \ = Pal{x\ ■ ■ ■ x n ) = Pal(wv) and u Pn = Pal(x± ■ ■ ■ x Pn ) = Pal(w). 
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So now, using formula (|4.ip . we have Pal{wv) = fj, w (Pal(v))Pal(w); in particular, |u n+ i| — \u Pn \ = 
\Pal(wv)\ — \Pal(w)\ = \(jL w (Pal(y))\. Thus, Lemma [4. 151 tells us that for q G Q n , there exist words 
w, v such that q E F(Pal(wv)) and \q\ > \/i w (Pal(v))\. 

Conversely, assume A = wvy with Alph(w) = Alph(vy). Let vi, Vi be such that v = v\V2 with 
V2 the smallest suffix of v such that Alph(v2) = Alph(t>2y). Let n = \wv\. It follows from the 
choice of vi that p n = \wvi\+l. By Theorem 14. 10|. Pal{v2) is a quasiperiod of the epistandard 
word directed by i^y- By Fact 14.11 fJ, WVl (Pal(v^) is a quasiperiod of s. Since any occurrence of 
/J«wvi(Pal(v2)) in s is followed by Pal(wvi), we deduce that any factor of fi WVl (Pal(v2))Pal(wvi) 
of length greater than \/J, WVl (Pal (v2))\ is an ultimate quasiperiod of t. Using formula (14, ip . we 
see that Pal(wv\) = fi w (Pal(vi))Pal(w) so that fJ. WVl (Pal(v2)) = fJ, w (/j, vl (Pal(v2))Pal(vi)) = 
fi w (Pal(viV2)) = fj, w (Pal(v)). Hence any factor of fj, w (Pal(v))Pal(w) = Pal(wv) of length greater 
than \/jL w (Pal(v))\ (> \fx WVl (Pal(v2))\) 1S an ultimate quasiperiod of t. □ 

4.6 Quasiperiods of episturmian words 

We are now ready to state the main theorem of this section, which describes all of the quasiperiods 
of an episturmian word. 

Theorem 4.19. The set of quasiperiods of an episturmian word t is the set of all words 

Hu,(Pal(v))p, with p a prefix of S^ 1 Pal (w), (4-6) 

where w, v are L-spinned words such that t is directed by wvy for some spinned version w of w 
and some spinned version y of an L-spinned infinite word y with Alph(u) = Alph(uy). 

Moreover, the smallest quasiperiod oft is the word [iw(Pal(v)) where wv is of minimal length 
for the property Alph(uw) = Alph(u>z;y), and amongst all decompositions of wv into w and v, the 
word v is the shortest suffix of wv such that Alph(v) = Alph(vy). 

The above theorem shows that if there do not exist words w, y, y, and v (as defined above) 
such that t is directed by wvy with Alph(u) = Alph(vy), then t does not have any quasiperiods, 
and hence t is non-quasiperiodic. For instance, any regular wavy word A (recall the definition from 
Section 13, 4p clearly directs an episturmian word with no quasiperiods since A is the only directive 
word for t and it does not contain an L-spinned factor v containing all letters that follow it in A. 
For example, (abc)^ directs a non-quasiperiodic episturmian word in the subshift of the Tribonacci 
word. 

Let us now illustrate the last part of the theorem. For the epistandard word directed by A = 
ca(ab) UJ , i.e., the image of the Fibonacci word by the morphism L c L a , the shortest word wv such 
that A = wvy for some infinite word y with Alph(uw) = Alph(uwy) is the word caab. There are 
three ways to decompose this word caab into wv: 1) w = e and v = caab; 2) w = c and v = aab; 
3) w = ca and v = ab. The corresponding quasiperiods of the form fj, w (Pal(v)) are respectively: 
1) Pal(caab) = cacacbcacac; 2) fx c (Pal(aab)) = cacacbcaca; 3) fj, ca (Pal(ab)) = cacacbca. 

Proof of Theorem \4-lS\ Corollary 14.131 proves that any word of the form (|4.6p is a quasiperiod of 
t. 

Now suppose that q is a quasiperiod of t. We show that t has at least one directive word of 
the form wvy where w, y are spinned versions of some L-spinned words w, y, and where v is an 
L-spinned word such that Alph(-u) = Alph(uy). Moreover, we show that q = /j,^(Pal(v))p for some 
prefix p of Sr 1 Pal(w). 

Let s be an epistandard word with L-spinned directive word A such that t is directed by a 
spinned version of A. The quasiperiod q is an ultimate quasiperiod of t that occurs as a prefix of 
t. So, by Lemma f4. 181 

q S F(Pal(wv)), with \q\ > \fi w (Pal(v))\, 
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for some L-spinned words w, v, and y such that A = wvy with Alph(u) = Alph(fy). In 
particular, we have Pal(wv) = fqg for some words /, g with |/| + \g\ < \Pal(w)\, where 
Pal(wv) = /i w (Pal(v))Pal(w). By definition of the Pal function, Pal(w) is a prefix of Pal{wv). 
Consequently / is a prefix of Pal(w). 
Note that 

q = r^wiPaKv^Pa^g- 1 . (4.7) 

Thus q = (f~ 1 fj lw (Pal(v))f)p with p := f~ 1 Pal(w)g~ 1 a prefix of f~ l Pal(w). 

The following result shows that f = Sw for some spinned version w of w. 

Lemma 4.20. Given a word w and a prefix f of Pal (w), there exists a spinned version w of w 
such that f = S^,. 

Proof. The proof proceeds by induction on \w\. The lemma is clearly true for \w\ = since in this 
case / is a prefix of Pal(w) = Pal(e) = e, and hence / = e = S £ . 

Now suppose \w\ > 1 and let us write w = xw' where x is a letter. Since / is a prefix of 
Pal(w) = Pal(xw') = [i x (Pal(w'))x (see formula (|4.ip ). we have / = [i x (f) or / = /j, x (f')x for 
some prefix /' of Pal{w'). Moreover, by the induction hypothesis, /' = S^i for some spinned version 
w' of w'. Hence, using formula (|4.3p . we have / = ^(SV) = S x ^,i or / = fi x (S^)x = S x ^. 
That is, f = Sw for some spinned version it? of w = xw' . □ 

Now we have to prove that t is directed by a spinned infinite word beginning with wv. For this 
we need some further intermediate results, as follows. 

Lemma 4.21. For any word u containing at least two different letters and for any other word w, 
there exists a word u w containing at least two different letters such that [J, w (u) = Pal(w)u w . 

Proof. The proof proceeds by induction on \w\. The lemma is trivially true for \w\ = 0. Now 
suppose \w\ > 1 and let us write w = xw' where x is a letter. Then (jl w {u) = fJ. X w'(u) = fi x (fi w '(u)) 
where, by the induction hypothesis, fi w i(u) = Pal(w')u w > for some word u w * containing at least 
two different letters. Hence, n w (u) = fi x (Pal(w')u w >) = (j, x (Pal(w'))fj, x (u w >), and therefore by 
formula (|4TT|) (u) = Pal(xw')x l ^ x {u w i) where the word x contains at least 

two different letters. This completes the proof of the lemma. □ 

Corollary 4.22. For any letter x and for any words v, w such that v contains at least one letter 
different from x, 

\n w {Pal(vx))\ > \Pal{wv)\ + 2. (4.8) 

Proof. We distinguish two cases: x Alph(u), x € Alph(u). If x ^ Alph(u), then Pal(vx) = 
Pal(v)xPal(v) and the word u = xPal{v) contains at least two different letters. On the other hand, 
if x G Alph(u), then v = V\XV2 where V2 is x-free, in which case Pal(vx) = Pal{v)Pal(v\)~ l Pal(v) 
and the word u = Pal{v\)~ l Pal(v) contains at least two different letters. Hence, in either case, 
Pal(vx) = Pal(v)u where u is a word containing at least two different letters. Thus, it follows from 
Lemma 14.211 that /j, w (Pal(vx)) = fi w (Pal(v))Pal(w)u w for some word u w containing at least two 
different letters. Since Pal{wv) = ^ w {Pal{v))Pal{w) and \u w \ > 2, the proof is thus complete. □ 

Note. Inequality (|4.8p is not true in general. For instance, when v = e and w = xx, we have 
/j, w (Pal(vx)) = x and Pal(wv) = xx. 

Let us come back to our proof of Theorem 14.191 Writing v = v'x, we have \q\ > \/j, w (Pal(v))\ > 
\Pal(wv')\. Hence as an immediate consequence of Corollary 14.22^ when v contains at least two 
different letters, Pal{wv) is the smallest palindromic prefix of s of which q is a factor. Therefore by 
Theorem 14.71 and Lemma 14.81 the returns to q in t are the words f~ l ^wv{d)f where a E Alph(y). 
Consequently, each occurrence of q is preceded by the word /. Thus, the set of factors of ft is 
exactly the same as the set of factors of t; whence, the infinite word ft (which is clearly recurrent) 
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is episturmian. Moreover, returns to fq in ft are of the form jJLyjv (a)f for letters a G Alph(y). 
Hence we deduce that there exists an infinite word t' such that ft = fi wv (t'). Moreover, we deduce 
from the following lemma that t' is episturmian. 

Lemma 4.23. For any letter a, an infinite word w is episturmian if and only if /x a (w) is epistur- 
mian. 

Proof. (=>): Immediately follows from Theorem l2 , II (see also Corollary 3.12 in [23] which shows more 
generally that if w is episturmian and tp is an episturmian morphism, then <£>(w) is episturmian). 

(<=): Conversely, suppose z := /i a (w) is an episturmian word. Then z begins with the letter a. 
Hence, by Proposition 14.91 z is directed by a spinned infinite word A beginning with a, say A = ay 
for some spinned infinite word y. So by Theorem 12.11 and Remark 12.21 z = fJ-ai 2 ') where z' is an 
episturmian word directed by y. By the injectivity of fj, a , z' = w; whence w is episturmian. □ 

Now, since / = we have t = ^ v {t') by formula (j4.4jl . and so t is directed by a spinned 
infinite word beginning with wv. 

It remains to consider the case when v is a power of a letter x. In this case, the condition 
Alph(u) = Alph(i;y) implies that y = thus s (and hence t) is periodic. More precisely, 
s = {n w {x)Y and q = f~ 1 {J- w {x)f- Since / = S^, it follows from formula (14. 3j) that q = /^(x) and 
t = q u showing that t is directed by wx^. 

The proof of Theorem 14.191 is thus complete, except for the last part concerning the smallest 
quasiperiod which we prove below. 

As previously, let s be an epistandard word such that t belongs to the subshift of s and let A be 
the L-spinned directive word of s. First of all, we observe that the smallest quasiperiod of t is of the 
form q := fj,^,(Pal(v)), where w is a spinned version of a word w such that A = wvy and Alph(i>) = 
Alph(uy). Assume first that v has a proper suffix s such that Alph(s) = Alph(sy), so that writing 
v = ps for a non-empty wordp, n^ p (Pal(s)) is also a quasiperiod oft. By formula (|4.ip . /j, p (Pal(s)) 
is a proper prefix of Pal(ps) = Pal{v). Thus fj,^ p (Pal(s)) is a proper prefix of fiw(Pal(v)) showing 
that the second of these words is not the smallest quasiperiod of t. 

From now on, we consider L-spinned words wx, u>2, v\, V2, spinned versions t&i and W2 of w\ and 
W2 respectively, spinned words yi and y2 such that n^, l (Pal(vi)) and ^ 2 {Pal{v2)) are quasiperiods 
of t, A = w\V\yi = W2V2Y2, Alph(ui) = Alph(uiyi), and Alph^) = Alph^y^)- Moreover we 
assume that v\ and V2 verify the following hypothesis: 

(H) for i = 1, 2, Vi has no proper suffix Sj with Alph(sj) = Alph(sjyj) 

Note that \n^ 1 {Pal{vi))\ = \/j, Wl (Pal(vi))\ and \^ 2 (Pal(v2))\ = \^ W2 {Pal{v2))\, so that to deter- 
mine whether fi^ 1 (Pal(vi)) or ^ 2 {Pal{v2)) is the smallest quasiperiod, we just have to determine 
which of the two words fi Wl (Pal(vi)) and fi W2 (Pal(v2)) is the shortest word. But then we can use 
the fact that wiv\ and W2V2 are both prefixes of A. When \w\\ = \w2\, Hypothesis (H) implies that 
u>i = W2 and v\ = V2- 

Before considering the case where | < |u>2^2|) let us recall that if a is a letter and if x 
is an a-free word, then /j, x (a) = Pal(x)a (see formula (|4.ip ). Moreover, we observe that for any 
word x and distinct letters a and 0, Pal(xa) is a prefix of fi x (Pal(a(3)). Indeed, fi x (Pal(a(3)) = 
^xdj'a(P)ct) = (i xa ((3a) and this word contains Pal(xa) as a prefix by Lemma T4.2H 

Assume < |u>2^2| (the case > 1^2^21 is symmetric) and let us show that 

\/j,^ 1 (Pal(vi))\ is less than (or equal to) \/j,^, 2 (Pal(v2))\- In this case wivi is a proper prefix of 
^2^2. Let x be the non-empty word such that w\Vix = u>2t>2- Hypothesis (H) implies that v\ 
cannot be a factor of t>2 except as a prefix. Thus there exists a possibly empty word y such that 
v\x = yv2- Assume x has length at least 2 and is a suffix of V2- Then Hypothesis (H) implies that 
the two last letters of V2 and so of x are different. Let a and (5 be the two last letters of x, and 
let x' be the word such that x = x'a[3. Then Pal{v\) is a prefix of Pal{v\x') which is a prefix of 
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by Lemma [4.2H Finally we can see that n VlX i(af3) is a prefix of /i y (Pal(v 2 )). This shows 
that Pal(v\) is a proper prefix of /j, y (Pal(v 2 )) and so n^, 1 (Pal(vi)) is a shorter quasiperiod than 
HHj 2 (Pal{v 2 )). 

Now consider |x| = 1 and \v 2 \ > 2. Let a be the last letter of v\ and let v[ and ^ be the 
words such that v\ = v[a, and v 2 = v' 2 ax. Then Pal(v\) = Pal(v[a) is a proper prefix of 
/V (Pal(ax)) = fj, yv i (Pal(ax)) (see the paragraph before last, taking a/3 = ax), which is a prefix 
of /i y (Pal(v 2 )). So again we find that fi^^Pal^vi)) is a shorter quasiperiod than fiit, 2 (Pal(v 2 )). 

Now we come to the case when v 2 = a for a letter a. If x contains a letter different from a, then 
Pal(vi) is a proper prefix of fj, vi (fi x (a)) = fi y (a) = fi y (Pal(v 2 )). We still conclude as previously. 

Lastly, assume that x is a power of a. Hypothesis (H) implies that a is the first letter of v\ and 
more precisely v\ = av[ with v[ an a-free word. Thus Pal{v\) = n a (Pal(v[))a = /x a (Pa/(f 1 )a) = 
MaC/V ( a )) = /^oti^ ) = l^viix) = fi y (v 2 ). So in this particular case ^^{Palivi)) = fi^, 2 (Pal(v 2 )), 
and once again n^ 1 (Pal(vi)) is the smallest quasiperiod. □ 

Remark 4.24. Theorem 14.191 shows in particular that if an episturmian word t is directed by a 
spinned word A with all spins ultimately L, then t has infinitely many quasiperiods since there are 
infinitely many factorizations of A into the given form (i.e., any positive shift of an epistandard 
word is quasiperiodic). 

Let us demonstrate Theorem 14.191 with some examples. First we provide an example of an 
episturmian word having infinitely many quasiperiods, but which is not epistandard and all of its 
directive words are wavy (recall that a wavy word is a spinned infinite word containing infinitely 
many L-spinned letters and infinitely many P-spinned letters). 

Example 4.25. Consider the episturmian word t with normalized directive word A = abc(ba) u . 
From Theorem 13.21 we observe that t has infinitely many directive words: 

A = abc(b(£f and A, = abc(bayba(ba) u for each i > 0, 

all of which are wavy. Hence, by Theorem 14.191 the set of quasiperiods of t consists of Pal(abc) = 
abacaba and, for each i > 0, the words: 

(j,tf,(bab) and fi^(bab)a where w = abc(ba) 1 . 

Note that Pal{ba) = bab and SZ l Pal(w) = {Pal{w)a- 1 y 1 Pal{w) = a. 

Next we give some examples of quasiperiodic episturmian words having only finitely many 
quasiperiods. 

Example 4.26. Consider the quasiperiodic episturmian word fJ- a dbcd( ar ) w here r is the Tribonacci 
word; it has only two directive words: adbcdfabc)^ and adbcdfabc)^ (the first one being its 
normalized directive word). So we see that t has only one quasiperiod, namely Pal(adbc) = 
adabadacadabada. Similarly, the episturmian word /x c feaa(cr), which is directed by exactly three dif- 
ferent spinned infinite words [cbaa{abc) u , cbada(bca) w , and cbdaaibca) 1 ^) , has only two quasiperiods: 
Pal(cba) = cbcacbc and Pal(cbaa) = cbcacbcacbc. 

We can also construct episturmian words having exactly k quasiperiods for any fixed integer 
k > 1, as shown by the following example. 

Example 4.27. Consider the episturmian word with normalized directive word A = abcd^ca)^ . 
By Theorem 13.21 t has exactly two directive words: A and abcalpca)^ . Hence, by Theorem I4.19| t 
has only one quasiperiod, namely Pal{abc). Now, to construct an episturmian word having exactly 
k quasiperiods for a fixed integer k > 1, we consider wA = wabcdipca)^ where w is a spinned 
version of any L-spinned word w with Alph(w) n {a,b,c} = and \Pal(w)\ = k — 1. Then, by 
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Theorem 14.191 the set of quasiperiods of the episturmian word directed by wA consists of the k 
words: 

^{Pal{abc))p where p is a prefix of Pal(w). 

For example, d k ~ 1 A = d k ~ 1 abca(bca) UJ directs an episturmian word with exactly k quasiperiods: 
^^(Pa^abc^p = d k ~ 1 ad k ~ 1 bd k ~ 1 ad k ~ 1 cd k ~ 1 ad k ~ 1 bd k ~ 1 ap where p is a prefix of d k ~ x . 

4.7 Characterizations of quasiperiodic episturmian words 

From Theorem 14. 191 we immediately obtain the following characterization of quasiperiodic epistur- 
mian words. 

Theorem 4.28. An episturmian word is quasiperiodic if and only if there exists a spinned word 
w, an L-spinned word v and a spinned version y of an L-spinned word y such that t is directed by 
wvy with Alph(u) = Alph(uy). 

The above characterization is clearly not useful when one wants to decide whether or not a 
given episturmian word is quasiperiodic. In this regard, our normalized directive word plays an 
important role as it provides a more effective way to decide. 

Theorem 4.29. An episturmian word t is quasiperiodic if and only if the (unique) normalized 
directive word oft takes the form wavidv2d ■ ■ ■ v^dvy for some L-spinned letter a, spinned word w, 
a-free L-spinned words v, v\, . . . , Vk (k > 0), and a spinned version y of an L-spinned word y 
such that Alph(aw) = Alph(auy). 

Proof. First assume that an episturmian word t is directed by wavidV2d ■ ■ ■ Vkdvy (as in the hy- 
potheses). Then by Theorem 13.11 t is also directed by ivdvid^a ■ ■ ■ Vkavy. By Theorem 14.281 t is 
quasiperiodic. 

Now assume that t is quasiperiodic. By Theorem 14.281 one of its directive words is iviv'y for 
spinned words wi, y and an L-spinned word v' with Alph(V) = Alph(w'y). If v' contains only 
one letter a, y = a w and the normalized directive word of t ends with aa w , then the condition is 
verified. 

Now assume that v' contains at least two (different) letters: let us write v' = av. Without loss 
of generality, we can assume that y and w\ are normalized. If w\ has no suffix in a A* , then 

w\avy is normalized and of the required form (with k = 0). Otherwise w\ = waviav~2d ■ ■ ■ Vj~_ for 
some spinned words w, y and some a-free L-spinned words v i, . . . , Vk (k > 0) such that w has no 
suffix in dA* . Then the normalized directive word of t is wav\dV2d ■ ■ ■ vtdvy. □ 

Example 4.30. The episturmian words with directive words (a6c)^, (abcdbc) 1 ^ or (abdcadbcbcb)^ 
are not quasiperiodic whereas the one with normalized directive word abdbcc^dbc)^ is quasiperiodic 
(it is also directed by dbabcc(dbc) u ') . 

Remark 4.31. It follows from Theorem 14.281 that any quasiperiodic episturmian word has at least 
two directive words. Indeed, if t is a quasiperiodic episturmian word, then t has a directive word of 
the form A = wvy where the words w, y are spinned versions of some L-spinned words w, y and v 
is an L-spinned word such that Alph(u) = Alph(-uy). If y (and hence A) has all spins ultimately L, 
then item [2] in part Hi) of Theorem 13.21 shows that t also has a wavy directive word. Now suppose 
y does not have all spins ultimately L. Then y must be wavy, and hence contains infinitely many 
i?-spinned letters. Choose x to be the left-most i?-spinned letter in y. Then y begins with ux 
for some L-spinned word u (possibly empty). Hence, since x occurs in v, we see that A = wvy 
contains the factor vux = v'xv"ux. Thus A does not satisfy the conditions of Theorem 13.71 as it 
contains a factor in xA*x, and therefore t does not have a unique directive word. Moreover, we 
easily deduce from Theorems 14.281 and 13.71 that any episturmian word having a unique directive 
word is necessarily non-quasiperiodic. 
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In view of the above remark, one might suspect that an episturmian word is non-quasiperiodic 
if and only if it has a unique directive word. But this is not true. For example, both ba^bca) 1 ^ and 
babfcab) 1 ^ direct the same non-quasiperiodic episturmian word t by Theorems 13.21 and 14.281 These 
two spinned infinite words are the only directive words for t, which might lead one to guess that an 
episturmian word is non-quasiperiodic if and only if it has finitely many directive words. But again, 
this is not true. For example, as stated in Example 14.261 adbcd(abc) ul and adbcd^abc)^ are the only 
two directive words of the quasiperiodic episturmian word ^ a dbcd( ar ) where r is the Tribonacci word. 
Moreover there exist non-quasiperiodic episturmian words, such as the one directed by (abbc)^ , that 
have infinitely many directive words (the words (abbc) n abc{abbc) u are pairwise different directive 
words). 

Nevertheless, in the Sturmian case we have: 

Proposition 4.32. A Sturmian word is non-quasiperiodic if and only if it has a unique directive 
word. 

Proof. First let us suppose by way of contradiction that t is a non-quasiperiodic Sturmian word, 
but has more than one directive word. Then, by Proposition 13.81 t has infinitely many directive 
words. Moreover, as t is aperiodic, all of the directive words of t are spinned versions of the same 
A £ {a, b}^ \ {A*a w U A*b u ) by Fact 13.31 and none of these directive words are regular wavy. Hence 
the normalized directive word of s contains ab or ba. But then t is quasiperiodic by Theorem 14.281 
a contradiction. 

Conversely, by Proposition 13.81 if a Sturmian word t has a unique directive word, then its 
(normalized) directive word is regular wavy. Such a word clearly does not fulfill Theorem 14.281 and 
so t is not quasiperiodic. □ 

Let us recall that a Sturmian word is non-quasiperiodic if and only if it is a Lyndon word |27j . 
The generalization of this aspect will be discussed in Section [6l 

5 Quasiperiodicity and episturmian morphisms 

In this section we draw connections between the results of the previous section and strongly 
quasiperiodic morphisms. As in [29], a morphism / on A is called strongly quasiperiodic (on A) if 
for any (possibly non-quasiperiodic) infinite word w, /(w) is quasiperiodic. 

5.1 Strongly quasiperiodic epistandard morphisms 

Quasiperiodicity of epistandard words can also be explained by the strong quasiperiodicity of epi- 
standard morphisms. 

Theorem 5.1. Let v be a word over an alphabet A containing at least two letters. The epistandard 
morphism /j> v is strongly quasiperiodic if and only i/Alph(w) = A. Moreover Pal (v) is a quasiperiod 
of /z„( w ) for any infinite word w. 

To prove this result, which is a direct consequence of Lemma 15.21 (below), we need to consider 
infinite words covered by several words. We say that a set X of words covers an infinite word w if 
and only if there exist two sequences of words (p n )n>o and (z n ) n >o such that, for all n > 0, p n z n 
is a prefix of w, z n S X, pq = e and \p n \ < \p n +i\ < \PnZ n \- The last inequalities mean that p n is a 
prefix of p n +i which is a prefix of p n z n , itself a prefix of w. Once again the sequence (p n z n )n>o is 
called a covering sequence of prefixes of the word w. Observe that a word is covered by X if and 
only if it is covered by X U {e}. 

Lemma 5.2. For w an infinite word over A, y a letter, X a subset of A and u a word, if w is 
covered by {u} U {ux \ x G X U {y}} then L y (w) is covered by {L y (u)y} U {L y (u)yx | x G X \ {y}}. 
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Proof. If y G X , then L y (w) is covered by {L y (u)} U {L y (u)y} U {L y (u)yx | x G X \ {y}}, and 
each occurrence of L y (u) is followed by the letter y. So L y (w) is covered by {L y (u)y} U {L y (u)yx | 
x G X \ {y}}- If y $ X, thus X = X \ {y} and then L y (w) is covered by {.^(u)} U {L y (u)yx \ 
x G X}, and similarly each occurrence of L y (u) is followed by the letter y. So L y (w) is covered by 
{L,,(u)y} U {L y (u)yx | x G X}. □ 



Proof of Theorem \5.1\ Let t> = Ui • • • v\ w \ be a word (with each Vi a letter) such that Alph(u) = A. 
Let itui = e and, for any i from \v\ to 1, let = L Vi (in)vi: observe that Uj_i = Pal{vi ■ ■ -v\ v \). 
It is immediate that any infinite word w is covered by the set A, which can be expressed as 
A = {e} U {ex | x G .A} = {itu} U {itux | x G *4}. 

By induction using Lemma 15.21 we can state that, for 1 < i < \v\, (j, Vi ... v ,,(yr) is covered by 
{ui-i} U {tij_ix | x G A \ {vi, . . . ,v\ v \}}. So, since uq = Pal(v), /i«(w) is covered by {Pal(v)} U 
{PaZ(v)x |iGi\ {vi, . . . , v\ v \}}. Therefore, since Alph(u) = A, ^ v {w) is covered by Pal(v), that 
is, w is Pal(v )-quasiperiodic. Hence \i v is strongly quasiperiodic. 

Now let us consider the case where Alph(-u) ^ A. First suppose that v = e. Then, since A 
contains at least two letters (and so there exists at least one non-quasiperiodic word over A), the 
morphism fi v , which is the identity, is not strongly quasiperiodic. Assume now that n/e. Let a 
be a letter in A \ Alph(u) and let b be a letter in Alph(u). The word ^(abP) contains only one 
occurrence of the letter a, and so it is non-quasiperiodic. Thus the morphism fj, v is not strongly 
quasiperiodic. □ 

Remark 5.3. Note that any infinite word A can be written A = vy with Alph(u) = Alph(vy). So 
if t (resp. t' ) is the epistandard word directed by A (resp. y), then t = fi v (t') and t is quasiperiodic 
by Theorem I5.ll The same approach allows us to show that if an episturmian word is directed by 
a spinned infinite word with all spins ultimately L, then it is quasiperiodic, as deduced previously 
(see Remark I4.12P . Note also that Corollary 14.131 is a direct consequence of Theorem 15.11 



5.2 Useful lemmas 

We now introduce useful material for a second proof of Theorem 14.191 using morphisms (see Sec- 
tion 15 .3|) . The next results (that maybe some readers will read when necessary to understand the 
proof of Theorem I4,19j) show how to obtain some quasiperiods (or a covering set) of a word of the 
form L x (w) or R x (w) for an infinite word w (not necessarily episturmian). 

Lemma 5.4. (converse of Lemma l5.2p 

For an infinite word w over A, y a letter, X a subset of A and u a word, if L y (~w) is covered by 
{L y (u)y} U {L y (u)yx \ x G X \ {y}}, then w is covered by {u} U {ux \ x G X U {y}}. 

Proof. By hypothesis, there exist (p n )n>(b (un)n>o such that po = e, u n G {L y (u)y} U {L y (u)yx \ 
x G X \ {y}} and for all n > 0, both \p n \ < |p n +i| < |Pn^n| and p n u n is a prefix of w. 
For all n > 0, p n L y (u)y is a prefix of L y (w) so there exists p' n such that p n = L y (p' n ). 
We consider the following two complementary cases: 

1. Case u n = L y (u)y. 

The word p n L y (u)y = L y (p' n u)y is a prefix of L y (w) so p' n is a prefix of w. For the prefixes 
Pn+i and p n u n of w, we have |p n +i| < bn^nl) thus p n +i is a prefix of p n Un- 

If \p n +i\ < \p n u n \, let u' n = u: p n+1 = L y (p' n+1 ) is a prefix of p n L y (u) = L y (p' n u). Moreover 
Pn+iy and p n L y (u)y are prefixes of L y (w). So p' n+1 is a prefix of p' n u which itself is a prefix 
of w. 

If \p n +i\ = \p n u n \, then p n+1 = p n u n and from p n+1 u n+ i prefix of w, we know that p n +W is 
a prefix of w, hence p' n uy = p' n+ i is a prefix of w. In this case let u' n = uy. 
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(u)yx for some x G X \ {y}. In this case (as previously), we can see that with 
u' n = ux, p' n+ i is a prefix of p' n u' n itself a prefix of w. 

We have proved that the sequence (p' n u' n ) n >o is a covering sequence of prefixes of w. Thus the 
word w is covered by {u} U {ux \ x € X U {y}}. □ 

Lemma 5.5. (generalization of [29} Lem. 5.5]) 

Let w be an infinite word starting with a letter x. For any letter y ^ x, the word R y (w) is 
quasiperiodic if and only if w is quasiperiodic. 

Moreover if q is a quasiperiod of R y (w) then two cases are possible: 

1. q = R y (q') with q' a quasiperiod o/w; 

2. qy = Ry(q'z) with z ^ a a letter such that both q' and q'z are quasiperiods o/w. 

Proof. Assume first that w is quasiperiodic. By Fact 14. ll R y (w) is quasiperiodic. More precisely, 
if v is a quasiperiod of w, then R y (v) is a quasiperiod of R y (\v). 

Assume now that R y (w) is quasiperiodic and let q be one of its quasiperiods. By hypothesis, 
we know that q starts with x (as w). 

A first case is that q = R y (v) (for a word v) which is equivalent to the fact that q ends with the 
letter y. Let (p n )n>o be a sequence of words such that (p n q)n>o is a covering sequence of prefixes 
of R y (w). Let n > 0. From the fact that q starts with x, we deduce that there exists a word 
p' n such that p n = R y (p' n ). Then we can see that p' n v is a prefix of w. Moreover the inequality 
\Pn\ < \Pn+i\ < \PnQ\ implies \p' n \ < \p' n+1 \ < \p' n v\. So {p f n v) n >o is a covering sequence of prefixes 
of w: v is a quasiperiod of w. 

Now assume that q ends with a letter z different from y. Then q = R y (v)z for a word v and 
each occurrence of q is followed by the letter y. Let (p n ) n >o be a sequence of words such that 
(Pn,q)n>o is a covering sequence of prefixes of R y (w). Let n > 0. From the fact that q starts with 
x, we deduce that there exists a word p' n such that p n = R y {p' n ). Then we can see that p' n v is 
a prefix of w. Now since q does not end with y, the inequality \p n \ < |p n +i| < \PnQ\ is actually 
\Pn\ < |Pn+l| < \PnR y {v)\. And so (p' n v) n >o is a covering sequence of prefixes of w; thus v is a 
quasiperiod of w. Since each occurrence of R y (v) is followed by y, (p' n vz) n >o is also a covering 
sequence of prefixes of w; whence vz is also a quasiperiod of w. □ 

Lemma 5.6. Let x,y be two different letters, let w be an infinite word, and let u be a finite word 
such that L x (w) is uy- quasiperiodic. Then there exists a word v such that w is vy- quasiperiodic, 
uy = L x (vy), and \uy\ > \vy\. 

Proof. Let x, y, u, w be as in the lemma. Since uy is a prefix of L x {w) and since x ^ y, there exists 
a word v such that uy = L x {yy). We have \uy\ > \vy\. 

Let (p n )n>o be such that (p n uy) n >o is a covering sequence of prefixes of L x (w). Since uy is a 
prefix of L x (w), x is the first letter of u, and so from p n x prefix of L x (w), there exists for any 
n>0a word p' n such that p n = L x {p' n ). From the prefix L x (p' n vy) of L x (w), we deduce that p' n vy 
is a prefix of w. The inequality \p n \ < \p n +i\ < \p n uy\, that is \L x (p' n )\ < \L x {p' n+l )\ < \L x (p' n vy)\, 
implies that \p' n \ < \p' n+ i\ < \p' n vy\. Hence vy covers w. □ 

Remark 5.7. Lemmas 15.51 and 15.61 are not true when x = y. For instance ab u is not quasiperiodic 
whereas i? a (a6^) = afia)^ is quasiperiodic. Moreover the word L a (6 a; ) = {ab) w is aoa-quasiperiodic 
whereas b u has no quasiperiod ending with a. 

Lemma 5.8. Suppose t is an episturmian word starting with a letter z. Let x be a letter different 
from z and let u be a non-empty word. Then there exists a set B of letters such that the word R x {t) 
is covered by {u} U {u}B if and only if this word is covered by {u, ux}. 
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Proof. The if part is immediate. Assume that R x (t) is covered by {u} U {u}B. There is nothing to 
prove if B C {x}; hence we assume that B contains a letter y different from x. Since t starts with 
z and z ^ x, the word R x (t) and its prefix u start with z. If the word uy is a factor of R x (t), the 
word u ends with the letter x. Let (p n )n>0 and (u n ) n >o such that (p n Un)n>o is a covering sequence 
of prefixes of R x (t) and for all n > 0, u n € {u} U {u}£>. Let n be any integer such that u n = uy. 
Since y ^ x and z/i, the word is not a factor of i? x (t). Since < |Pn^n|, since p n +i 

and p n tt n are prefixes of Rx(t), and since u n +i starts with z, necessarily |p n +i| < |iV^n| = \PnUy\- 
We deduce from what precedes that R x (t) is covered by {x} U {u}(B \ {y}). Hence considering 
successively all letters of B different from x, it follows that R x (t) is covered by {u, ttx}. □ 

Lemma 5.9. Let t be an episturmian word starting with a letter z. Let x be a letter different from 
z and let q be a word. If the word R x (t) is covered by {q,qx} then L x (t) is covered by xq. 

Proof. If q is empty, the result is immediate since in this case t = x u . If q ends with a letter 
different from x, then each occurrence of q is followed by an x and so R x (t) is covered by qx. Hence 
we can assume without loss of generality that q ends with x. Then q = R x (v) and qx = R x {yx) for 
some word v. By the same technique used in the proof of Lemma 15.51 we can see that t is covered 
by {v, vx}. This implies that L x (t) is covered by {L x (v), L x (v)x} with all the occurrences of L x (v) 
followed by an x. So L x (t) is covered by L x (v)x. It is well-known that L x {v)x = xR x {v) = xq. □ 

Lemma 5.10. Suppose t is an infinite word (not necessarily episturmian) covered by the set {q,qa} 
and by the set {q, qc} for some word q and two different letters a and c. Then t is q-quasiper iodic. 

Proof. Let {p n )n>o be prefixes of t and (q n )n>o be words belonging to {q, qc} such that (p n q n ) n >o 
is a covering sequence of t. Moreover let us assume that there is no word p except the elements 
of (p n )n>o such that pq is a prefix of t. Let n be any integer such that q n = qc. Since t is also 
covered by {q, qa} and since all the words p such that pq is a prefix of t belong to {p n \ n > 0}, we 
necessarily have < \pi\ (that is 7^ \PnQc\). The sequence (p n q) is a covering sequence 

of t; hence t is g-quasiperiodic. □ 

5.3 A second proof of Theorem 14.191 

We now give a second proof of Theorem 14.191 which provides further insight into the connection 
between quasiperiodicity and morphisms. 

For any spinned word w, let us denote by q^ the word SZ 1 Pal(w) appearing in Theorem 14.191 
Then, for any spinned word v and letter a, we have: 

• Qe = e; 

• Qav = L a (q$)a; 

• qav = Ra{lv)- 

Indeed by formula (|4.3p . we have S a v = L a (Sv) and Sati = L a (Sy)a = aR a (Sv). Hence q a n = 
S~$L a (Pal(v))a = (L a {Sij))~ l L a (Pal{v))a = L a (S~ l Pal{v))a = L a (q ii )a and, since L a (Pal(v))a = 
aR a (Pal(v)), q- a v = S^(aR a (Pal(v))) = (ai? a ( < S,))- 1 (aii a (Pa/(t;))) = R a (S7\Pal(v)). 

These formulae, which define recursively the word q$, will be helpful in the following proof. 

First assume that t is an episturmian word directed by ivvy where w is a spinned word, 
y is a spinned version of an L-spinned infinite word y and v is an L-spinned word such that 
Alph(-u) = Alph(uy). Then, by Theorem 15. 1\ Pal(v) is a quasiperiod of /j, v (t') where t' is 
the episturmian word directed by y. Now we prove by induction on \w\ that fj,^(Pal(v))p is a 
quasiperiod of t for any prefix p of q$. More precisely we prove that fiw(Pal(v)) is a quasiperiod 
of t = Hw(n v (t')) (this is also a consequence of Fact 14. ip and each occurrence of Hw(Pal(v)) in 
t is followed by the word q^, (which is a direct consequence of the above inductive formulae). 
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Observe that the previous fact obviously holds if w = e. When it holds, let (p n ) n >o be a se- 
quence of prefixes such that \p n \ < |p n +l| < \Pn^w{P a K v ))\ an d Pn/J"w(Pal(v))q^, is a prefix of 
t. Then L a (p n/ u lS (PaZ(?;))q , u,) = L a (p n )^(Pal(v))L a (qa) and is followed as a prefix of L a (t) by 
a. Note that the word L a (p n )^ aih (Pal{v))L a (q ii ,)a = L a (p n )^ a ^(Pal(v))q a ^ is a prefix of L a (t). 
Moreover we can verify that |L a (p n )| < \L a (p n+ i)\ < \L a (p n )n a ^,(Pal(v))\ since p n is a prefix of 
p n+ i itself a prefix oip n ^w(Pal{v)). Similarly R a (p n ^(Pal(v))qw) = R a {pn)^aw{Pal{v))R a {qw) = 
R a {Pn)^aw{Pal{v))qau, is a prefix of R a (t) and \R a {p n )\ < \R a (p n +i)\ < \R a {p n )Hau,{Pal{v))\. 

To end the proof of Theorem 14.191 we prove by induction on \q\ that if q is the quasiperiod of 
an episturmian word t then (at least) one directive word of t can be written A = wvy where w is a 
spinned word, y is a spinned version of an L-spinned infinite word y and v is an L-spinned word such 
that Alph(u) = Alph(wy). Moreover we have q — 1^1 it, (Pal(v))p with p a prefix of qm, = S^Pa^w). 

We observe that \q\ > 1 since q covers the infinite word t. 

When |g| = 1, for a letter a, q = a and the word t = a u is directed by a w which is of the form 
wva^ with w = e and v = a. Then q^ = e, and taking p = e, q = ^{Pal{v))p. 

Assume from now on that \q\ > 2. Without loss of generality, we assume that A = x\X2 ■ • • is 
the normalized directive word of t, and let (t^ n ^) n >o be the infinite sequence of episturmian words 
associated to t and A in Theorem 12.11 

Let us first consider: 
Case t = L a (tW) (that is, x\ = a for a letter a). 

Assume that q does not end with the letter a. By Lemma [5.6( q = L a {q{) for a quasiperiod 
qi of such that \q\\ < \q\. By the induction hypothesis, has a directive word of the 
form wvy with u>, v, y as in the theorem, and q\ = n^(Pal(v))p for a prefix p of q^. We have 
q = L a {ii^{Pal{v))p) = (i a u,(Pal(v))L a (p). Observing that t is directed by (aw)vy and that L a (p) 
is a prefix of L a (g^)a = q a ^^ we get the result for q and t. 

Now assume that q ends with the letter a, that is q = L a {q\)a for a word q\. By Lemma l5~4"l 
is covered by the set {qi,qia}. 

Possibly q±a is a quasiperiod of t^. In this case, q = L a (qia). Observe that \q\ = \L a {q\a)\ > 
\qia\. The equality holds only when a is the only letter occurring in q\ so in q. In this case, taking 
w = a' 9 ' -1 and v = a, the word t is directed by wva^ and we have q = aa' 9 ' -1 = Pal{a)q a \ q \-i = 
f' a \i\- 1 (P a K a )) ( lui = L JL w{PO'K v )) c lw'- the induction result holds here. When \q\ = \L a (q\a)\ > \qia\, 
the proof ends, using the induction hypothesis, as in the previous case where q did not end by a 
(the only difference is that q ends with the word L a (p)a but L a {p)a is still a prefix of q a w)- 

It is also possible that q\ is a quasiperiod of t^, but in this case we can once again conclude 
using the induction hypothesis. 

Now we assume that neither q\ nor q\a is a quasiperiod of t^. This implies in particular that 
has a factor q\b for a letter b different from a. So the following property holds for n = 1 (and 
a\ = a): 

Prop(n): t = L ai L a , 2 ■ ■ ■ L an (t^) (that is, ot{ = a^, for 1 < i < n), t( n ) is covered by 
the set {q n } U {q n ai \ i = 1, . . . , n} with q = q, and for i = 1, . . . , n, qi-i = L ai (%)aj. 
Moreover q n a and q n b are both factors of t( n ). 

Let us assume that Property Prop(n) holds for some n > 1 with q n ^ e and t^ n ^ = L an+1 (t( n+1 -*) 
(that is x n+ \ = a n+ i). From a ^ b (so a ^ a n+ \ or b ^ a n+ i), we deduce that q n c is a factor of t( n ) 
for a letter c ^ a n +i and so q n must end with the letter a n +i, that is, q n = L an+1 (q n +i)a n +i for 
some word q n +i- By Lemma |574"1 q n +i is covered by the set {g n+ i} U {q n+ iai \ i = 1, . . . , n, n + 1}. 
Moreover it is quite immediate that q n +ia and q n +\b are both factors of t( n+1 ). Hence Prop(n+l) 
holds. 

Observing that for any i > 1 we have \qi+i\ < \qi\, we deduce that one of the two following cases 
holds: 
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1. There exists an integer n > 1 such that Property Prop(n) holds with q n = e. In this case, we 
verify that q = Pal{a\ ■ ■ ■ a n ). Let w = £, v = a\ ■ ■ ■ a n and let y be any directive word of the 
episturmian word t^ n \ Then t is directed by wvy and q = ^u,(Pal(v)). Since is covered 
by the letters of v, we deduce that Alph(v) = Alph(uy). Hence the induction result holds. 

2. There exists an integer n > 1 and a letter c such that Property Prop(n) holds with q n ^ e 
andtW = i? c ( t(n+1) ) (that is x n+ i = c). By Theorem 13.51 since A is normalized, the word 
t( n + 1 ) does not start with the letter c (otherwise the infinite word x n +ix n +2 • • • starts with 
a factor in cAc). Hence the first letter of q n is not c whereas its last letter is c since a ^ b 
and both q n a and q n b are factors of R c (t ( - n+1 ^). By Lemma |5T8| we deduce that R c (t^ n+1 ^) 
is covered by {q n ,q n c}- Since for i = 1, . . . , re, q\-\ = L ai (qi)ai, we can inductively verify 
that the words (i = n, . . . , 1) are covered by {qi,qic}. In particular tW is covered by 
{liiQic}- By Lemma T5.1CH since we have assumed that t^ 1 ) is not gi-quasiperiodic, we have 
c = a. So i? a (t( n+1 )) is covered by {q n ,qnO'}- By Lemma I5T91 L a (t( n+1 )) is ag n -quasiperiodic. 
Observe that t = L ai L a2 ■ ■ ■ L an R ai (t< n+1 )) = R ai R a2 ■ ■ ■ -R an L a (t (n+1) ), and that \q n \ < \q\. 
By induction hypothesis, L a (t^ n+1 )) is directed by a word w'vy for a finite spinned word 
w', a finite L-spinned word v and a spinned version y of an infinite L-spinned word y such 
that Alph(f) = Alph(uy), and aq n = /i^(Pal(v))p' with p' a prefix of q^i. Thus we have 
proved that the word t is directed by wvy where w = a\a,2 ■ ■ ■ a n w', and jt-taia 2 -o„(<Z a n) = 
fiu,(Pal(v))p' . To end the proof of the current case we have to prove that q = fj,^(Pal(v))p' . 
First assume that <Zj = a for some i between 2 and n. We know that the word is covered 
by {qi,Qia} and t^" 1 ) = L ai (tW) = L a (tW). Hence by Lemma l5T2| t^ -1 ) is covered by 
L a (qi)a = Using Fact 14.11 we can deduce that is gia-quasiperiodic, a contradiction. 
Hence for i = 2, . . . , n, a, ^ a. By induction we can prove that Ha i ai +1 ---a n {( :i 'Qn) = aQi-i for 
i = n + 1, . . . ,2. Indeed if fJ.a l+1 -a n (aq n ) = aqi then fJ-a,a, +1 -a n (aq n ) = Va % { a< li) = Ra,{ a( li) = 
aaiR ai {qi) = aL ai (qi)a i = o%_i. Now fi ai a2 -a n (aqn) = R ai (aqi) = aR a (qi) = L a {qi)a = q. 

Now we come to: 

Case t = #a(tW). By Theorem [33J since A is the normalized directive word of t, the word t does 
not start with the letter a. By Lemma 15.51 the word t*- 1 ) is quasiperiodic and more precisely there 
exists a quasiperiod q\ of such that q = R a (qi) or q = R a {q\)z with z ^ a: in this last case, 
q\z is also a quasiperiod of t^. Since q does not start with a, \q\\ < \q\. By induction hypothesis, 
has a directed word of the form wvy with w, v, y as in the theorem, and q\ = fi^(Pal(v))p 
for a prefix p of g$. When q = R a (qi), since R a (Hw) = Mau> and R a {p) is a prefix of R a {qw) = 
Qaw, Q = fJ-aw(PO'l(v))R a (p) and t (which is directed by awvy) verify the induction result. When 
q = R a (q\)z with z / a, since q%z is also a quasiperiod of t^, we deduce that pz is a prefix of q^j 
and so R a (p)z is a prefix of R a (p)za a prefix of q a w, which allows us to conclude once again that 
q = fj,aw(P a K v ))Ra(p)z and t (which is directed by awvy) verify the induction result. 
This ends the second proof of Theorem 14.191 

5.4 Strongly quasiperiodic episturmian morphisms 

The aim of this section is to characterize all the episturmian morphisms that are strongly quasiperi- 
odic, i.e., the episturmian morphisms that map any infinite word onto a quasiperiodic word. Our 
characterization (Theorem 15. 14[) generalizes Theorem 15.11 to all episturmian morphisms. 

Looking at Theorem 15.11 and Theorem 14.191 one might guess that an episturmian morphism fi% 
is strongly quasiperiodic if and only if u has an L-spinned factor v. We will see that it is not the 
case. In particular, there exist i?-spinned words it such that \iu is strongly quasiperiodic, as shown 
by the following result. 

Lemma 5.11. Let u be a finite spinned word over A U A and let a be a letter in A. If there exist 
L-spinned words v, y and a spinned word w such that u = wavy and Alph(v) = A \ {a}, then for 
any infinite word t, /x&(at) is quasiperiodic. 
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Let us mention that for instance the word abcab verifies the condition of the previous lemma for 
each of the letters a, b and c, so that Habmb * s strongly quasiperiodic over {a,b,c}. 
Lemma 15.111 is a consequence of the following one. 

Lemma 5.12. Suppose a is an L-spinned letter, x is an L-spinned word, v is an a-free L-spinned 
word, and w is an infinite word over jA. Then the words f-iav 

(xa), Haviax), and /j 5i (aw) are 

Pal(av) -quasiperiodic. 
Proof. 

(1) By Theorem 15.11 the word iiavixa?) is Pal (au)-quasiperiodic where Pal(av) = fi av (a) since 
Pal{ava) = ^, av {a)Pal{av) = Pal{av)Pal{av) by formulae (|4.1I) and (|4.5p . Consequently, 
since fi av (xa) ends with /U a „(a), it is Pal{av )-quasiperiodic. 

(2) By formulae (j4.3j) and (|4.2p . /^(xa) = fj, av (xa)Sav and Sav = Pal(av). So 
Pal(av)iiav{ax) = l^av{ax)^ av (a) = fi av (axa) = fi av (a)fi av (xa) = Pal(av)fi av (xa), that is 
Hav(cix) = fi av (xa). It follows from (1) that //^(ax) is Pai(au)-quasiperiodic. 

(3) Prom (2), we deduce that /i 5i j(aw) has infinitely many quasiperiodic prefixes with quasiperiod 
Pal(av). Hence /ias(aw) is PaZ(aw)-quasiperiodic. 

□ 

Proof of Lemma \5.11[ Let u, a, w, v, y be as in the hypotheses of Lemma [5.111 Let t be an infinite 
word over A. The word fiy(at) starts with a. By Lemma \5. 121 ^avi^yi^)) is quasiperiodic and so 
Hu(at) = fj-wavyicit) is quasiperiodic. □ 

The following remark will be useful several times (see the proof of Theorem l4.29l for more details): 

Remark 5.13. An infinite word over A U A has a decomposition wvy with Alph(w) = Alph(uy) 
if and only if its normalized decomposition can be written in the form w'av\av2 ■ ■ ■ av^y' with 
Alph(awj) = Alph(at> jOf j + i • • ■ av^y') for some 1 < i < k. 

Now we state our characterization of strongly quasiperiodic episturmian morphisms. 

Theorem 5.14. Let A be an alphabet containing at least three letters. An episturmian morphism 
is strongly quasiperiodic on A if and only if its normalized directive word u verifies one of the 
following three conditions: 

i) u = wav\av2a ■ ■ ■ v^avy 

where a is a letter of A (with spin L), w, y are spinned words, v\,...,Vk (k > 0) are a-free 
L-spinned words, and v is an L-spinned word such that Alph(af) = Alph(awy). 

ii) For any letter a in A, u = wavy for some spinned word w and L-spinned words v, y such 
that Alph(?;) = A \ {a}. 

Hi) u verifies case ii) for all letters in A except for one letter a £ A such that u = wvay where v 
and y are L-spinned words verifying Alph(?;) = {a}. 

Before proving this theorem, let us observe that we do not include in this result the Sturmian 
case. Indeed, Theorem 15.141 is no longer valid when n < 2. For instance the morphism /i^ 
(a i— > aba, b t— > ba) is strongly quasiperiodic but its normalized directive word fulfills none of the 
above conditions. A complete description of strongly quasiperiodic Sturmian morphisms is provided 
in [29]. 
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Proof. Let u be the normalized directive word (of the morphism 

If u verifies i), then by Theorem 13.11 av\av2a ■ ■ ■ Vf.av = avxav^a ■ ■ ■ v^av. Moreover from Theo- 
rem [^^^ the epistandard morphism fiav is strongly quasiperiodic. Thus \x& is strongly quasiperiodic 
on A. 

If u verifies ii), then by Lemma l5.11l n% is strongly quasiperiodic over A. 

If u verifies Hi), then Lemma 15.111 /x#(t) is quasiperiodic for any word t that does not start 
with a. 

Let us decompose y = yoayi ■ ■ ■ ay^ where k is the number of occurrences of a in y. In the proof 
of Lemma f5 .121 we have seen that \iay k {ax) = fx ayk (xa) for any finite word x. This formula naturally 
extends to any infinite word t, fJ,a,y k (at) = /J. ayk {t). Hence /i«(at) = ^wvay ay 1 --y k _ 1 ay k {^)- By The- 
orem ED fj, ma = fJ-ayiS, for each i, so that /ift(at) = Hwva yo ay 1 a~-y k _ 1 ay k (t). From Theorem EH the 
epistandard morphism fj, va is strongly quasiperiodic. Thus //&(at) is quasiperiodic. Consequently 
//jj is strongly quasiperiodic on A. 

To end, we prove that if u verifies none of the conditions i)-iii), then there exists (at least) 
one word t such that /x#(t) is not quasiperiodic (and so is not strongly quasiperiodic). This is 
immediate if u = e. 

• Let us first consider the case where u ends with an L-spinned letter a, that is u = wa for some 
spinned word w. Since \A\ > 2, there exist m > 2 pairwise different letters a\ = a, a<i, ■ ■ ■ , a m 
such that A = {a%, 02, . . . , a m }. Let t be the episturmian word with normalized directive word 
(a2 • • • a m a) u '. Since wa is normalized, wa(a2 ■ • • flmo)^ is also normalized. Moreover since u 
does not verify condition i), the word wa(a~2 ■ ■ ■ fl m «) u cannot be decomposed into the form 
w'bv\bv2 ■ ■ ■ bv^y' ■ ■ ■ where b is an L-spinned letter, w' is a spinned word, y' is a spinned 
version of an L-spinned word y 1 , and v\,...,Vk (k > 0) are 6-free L-spinned words such 
that Alph(6?/) = Alph(bv'y'). By Remark 15.131 and Theorem 14.291 the word /i#(t) is not 
quasiperiodic. 

• Now we consider the case when u ends with an i?-spinned letter, that is u = wv for a non- 
empty L-spinned word v and a spinned word w such that w = e or w ends with an L-spinned 
letter. Two cases can hold: 

Case 1: A± Alph(u). 

Let a be a letter in A \ Alph(u), and let b be any other letter (remember \A\ > 2). 
Let t be the episturmian word with normalized directive word (ab)^ and let t' = /u„(t). 
Then t' is directed by ii' = wv{ab) u ' and, since wv is normalized and a Alph(u), 
ii' is normalized. Moreover since wv does not verify i), ii' cannot be decomposed into 
the form w'cv\cv2 ■ ■ ■ cv^y' ■ ■ ■ with an L-spinned letter c, a spinned word w', a spinned 
version y' of an L-spinned word y' and some c-free L-spinned words v\,...,vt (k > 0) 
such that Alph(o;fc) = A\ph.{cvj t y l ). By Remark 15.131 and Theorem 14.291 the word //fi(t) 
is not quasiperiodic. 

Case 2: A = Alph(u). 

Since u does not verify ii), there exists a letter a and a- free L-spinned words vo,...,Vk £ 
.4* such that u = wv^avi ■ ■ ■ avk and for all i, 1 < i < k, Alph(-Uj) 7^ A \ {a}. Moreover 
since u does not verify Hi), then either vq 7^ e, or w cannot be written in the form 
w = w'v' for a spinned word w and an L-spinned word v' such that Alph(v') = A \ {a}. 
Since A contains at least three letters, there exist m > 3 pairwise different letters a\ = a, 
0,2, ■ ■ ■ , a m such that A = {a\, 02, • • • , a m }. 

Let t be the episturmian word with normalized directive word (00203 • • • a m ) u and 
let t' = /U^(t). Then t' is directed by wv^avi ■■■ 0^00203 ••• a m (002 ••• a m ) w 
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ivvQav \a ■ ■ ■ avkaa2a3 ■ ■ ■ a m (00203 • • • a m ) aJ . Since w ends with an L-spinned letter and 
each Vi (0 < i < k) is a-free, this word is normalized. 

Since wv does not verify i) and from the previous observations, the word 
wvoavia ■ ■ ■ au^ 002^3 • • • a m (aa2 • • • a-mY cannot be decomposed into the form 
w'bv[bv2 ■ ■ ■ bvLy' ■ ■ ■ with an L-spinned letter b, a spinned word w' , a spinned 
version y' of an L-spinned word y' and some 6-free .L-spinned words v'±, . . . ,v' k (k > 0) 
such that with A\ph(bv' k ) = A\ph.(bv' k y'). By Remark 15.131 and Theorem 14.291 the word 
/Xjj(t) is not quasiperiodic. 

□ 

6 Episturmian Lyndon words 

Theorem 14.281 provides a characterization of quasiperiodic episturmian words. In the binary case, 
it was proved in [29] that a Sturmian word is quasiperiodic if and only if it is not an infinite 
Lyndon word. A natural question to ask is then: "does this result still hold for episturmian words 
on a larger alphabet?" By a result in [29], one can see that any infinite Lyndon word is non- 
quasiperiodic. In this section, we show that there is a much wider class of episturmian words 
that are non-quasiperiodic, besides those that are infinite Lyndon words. This follows from our 
characterization of episturmian Lyndon words (Theorem 16.11 to follow). 

Let us first recall the notion of lexicographic order and the definition of Lyndon words (see [30] 
for instance). 

Suppose the alphabet A is totally ordered by the relation <. Then we can totally order A* by 
the lexicographic order < defined as follows. Given two words u, v £ A + , we have u < v if and 
only if either u is a prefix of v or u = xav! and v = xbv' , for some x, u' , v' S „4* and letters a, 
b with a < b. This is the usual alphabetic ordering in a dictionary. We write u < v when u < v 
and u ^ v, in which case we say that u is (strictly) lexicographically smaller than v. The notion of 
lexicographic order naturally extends to infinite words in A u . We denote by min(^l) the smallest 
letter with respect to the lexicographic order. 

A non-empty finite word w over A is a Lyndon word if it is lexicographically smaller than all 
of its proper suffixes for the given order < on A. Equivalently, w is the lexicographically smallest 
primitive word in its conjugacy class; that is, w < vu for all non-empty words u, v such that w = uv. 
The first of these definitions extends to infinite words: an infinite word over A is an infinite Lyndon 
word if and only if it is (strictly) lexicographically smaller than all of its proper suffixes for the 
given order on A. That is, a finite or infinite word w is a Lyndon word if and only if w < T l (u;) 
for all i > 0. 

In this section, we assume that \A\ > 1 since on a 1-letter alphabet there are no infinite Lyndon 
words. Also note that an infinite Lyndon word cannot be periodic. Therefore we consider only 
aperiodic episturmian words (i.e., those with |Ult(A)| > 1). 

6.1 A complete characterization 

In this section, generalizing previous results in [29] (Sturmian case) and [15] (Arnoux-Rauzy se- 
quences or strict episturmian words), we prove: 

Theorem 6.1. Let A = {a\, . . . ,a m } be an alphabet ordered by a% < 02 < ■ ■ ■ < a m and, for 
1 < i < m, let B-i = {a^, . . . , a m }. 

An episturmian word w is an infinite Lyndon word if and only if there exists an integer j such 
that 1 < j < m and the (normalized) directive word of w belongs to: 
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Note. In the above theorem, we have put the word normalized between brackets since one can easily 
verify from Theorem 13,71 that a spinned infinite word of the given form is the unique directive word 
of exactly one episturmian word. 

Example 6.2. Let A = {a, b, c, d}. Then the word (bca)(dcb) 2 (dcc) UJ directs a Lyndon episturmian 
word, so does aa{dc) w , but cabadcd^ does not (this spinned word directs a periodic word). 

Remark 6.3. The "if and only if" condition can be reformulated as follows: 

The (normalized) directive word o/w takes the form v i • ■ ■ Vjy where: 

• for 1 < k < j, Vk is a spinned word in {B^ +1 ak)* ; 

• y is a spinned infinite word belonging to (&j~ +l {aj} + ) w . 

Remark 6.4. Theorems 13. 71 and 16.11 show that any episturmian Lyndon word has a unique spinned 
directive word, but the converse is not true. Certainly, there exist episturmian words with a unique 
directive word which are not infinite Lyndon words. For example, the regular wavy word (abc) u is 
the unique directive word of the strict episturmian word: 




= acabaabacabacabaabaca ■ ■ ■ 



which is clearly not an infinite Lyndon word by Theorem 16.11 and also by the fact that acabaaw is 
not a Lyndon word for any order on {a, b, c} and for any word w. 

In order to prove the Theorem [67TJ, we recall two useful results and state a new one (Lemma [62]). 

Lemma 6.5. [M] An infinite word is a Lyndon word if and only if it has infinitely many different 
Lyndon words as prefixes. 

A morphism / is said to preserve finite (resp. infinite) Lyndon words if for each finite (resp. in- 
finite) Lyndon word w, f(w) is a finite (resp. infinite) Lyndon word. For episturmian morphisms, 
we have: 

Proposition 6.6. [421 [45] Let A = {a\, . . . , a m } be an alphabet ordered by a\ < 02 < • ■ ■ < a m . 

Then the following assertions are equivalent for an episturmian morphism: 

• / preserves finite Lyndon words; 

• / preserves infinite Lyndon words; 

• /e(^| a2) ... j0ro} V)*{^a m r. 

Now we prove a lemma concerning the action of morphisms in Lj^ U 7^4 on infinite Lyndon 
words. 

Lemma 6.7. Suppose w is an infinite word over an ordered alphabet A and let a, b G A with a < b. 
Then, the following properties hold. 

i) LbfL a (w) is not an infinite Lyndon word for any non-erasing morphism f. 

ii) R a fLb(w) is not an infinite Lyndon word for any morphism f in 7£V 

Hi) Lf w is recurrent, then R x fL x (w) is not an infinite Lyndon word for any letter x and mor- 
phism f in TZ*a- 
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Proof, i) The infinite word LbfL a (w) starts with b and contains an occurrence of the letter a; thus, 
since a < b, it cannot be an infinite Lyndon word. 

ii) As / € T^*ai the infinite word R a fLf 1 (w) starts with b and contains an occurrence of the letter 
a; thus, since a < b, it cannot be an infinite Lyndon word. 

Hi) To be an infinite Lyndon word, R x fL x (w) must be aperiodic, in which case a letter different 
from x occurs in it; in particular this letter occurs in fL x (w). Moreover, as / G 7^, the infinite 
word fL x (w) begins with x and hence with a prefix x n y for some integer n > 1 and a letter 
y ^ x. The recurrence of the infinite word w implies the recurrence of fL x (w), and so fL x (w) 
contains a factor zx n+r y for some letter z ^ x and integer r > 0. Now R x fL x (w) begins with 
x n y and contains zx n+r+1 y, and so contains x n+1 . To be an infinite Lyndon word, it needs x = 
min(Alph(i? x /L a; (w))), but then x n+l < x n y. Thus R x fL x (w) is not an infinite Lyndon word. □ 

Lastly, we need an important easy fact: 

Fact 6.8. |45j Any morphism / in £4 U 7^4 preserves the lexicographic order for infinite words. 
More precisely, for any infinite words w and w', w < w' if and only if /(w) < /(w'). 

A consequence of the above fact is that for any word w and for any morphism / in £4 U TZa, if 
/(w) is a Lyndon word then necessarily w is also a Lyndon word. 

Proof of Theorem \6.1\ Assume that A is the normalized directive word of a Lyndon episturmian 
word. Then A contains no factor of the form xvx for any letter x and v G A*. By Lemma 16.71 it 
does not contain any factor of the form bva or avb with v £ A* and a < b. Thus A takes the form 
given in the statement of the theorem. Indeed by item i) of Lemma 16.71 and by Fact 16.81 only one 
letter (namely ctj) can have all spins ultimately L. Since a Lyndon word is not periodic, at least 
one other letter in A should occur infinitely often. By items of Lemma 16.71 such a letter 

should belong to Bj+i. Moreover, the sequence of letters with spin L must be order-increasing and 
items of Lemma 16.71 determine the conditions on letters with spin R. 

Conversely, suppose that the (normalized) directive word A of the episturmian word w takes 
the form given in the statement of the theorem. Write A = v\ ■ ■ ■ Vj~y where, for 1 < k < j, Vk is 
a spinned word in (B^ +l ak)* and y is a spinned infinite word belonging to {B^ +1 {aj} + ) w . Then, 
because of the recurrence of the letter a. and of at least one other letter in £>j+i in y, there exists 
a sequence of spinned words (v n ) n >j, with each v n in 8j~ +l {aj} + , such that y = Yl n >j v n- Now, for 
each k > 1, fi Vk is a Lyndon morphism on Bk by Proposition I6.6L Hence, for each k > 1, the word 
fi Vl ... Vk (aj) is a Lyndon word. From w = limfc_^ 0O /i„ 1 ...„ fc (aj), we deduce from Lemma 16.51 that the 
episturmian word w is an infinite Lyndon word. □ 

6.2 Strict episturmian Lyndon words 

Let us recall from [23] that an epistandard word s, or any episturmian word in the subshift of s, 
is said to be ^4-strict if its L-spinned directive word A verifies Ult(A) = A. For these words, also 
called Arnoux-Rauzy sequences [5], Theorem 16.11 gives: 

Corollary 6.9. Let A = {cq, . . . , a m } be an alphabet ordered by a\ < 02 < ■ • • < a m . An A-strict 
episturmian word w is an infinite Lyndon word if and only if the (normalized) directive word of w 
belongs to {cq, a%, ... , a m } w . 

This can be reformulated as a generalization of Proposition 6.4 in |29j : 

Corollary 6.10. [15] An A-strict episturmian word t is an infinite Lyndon word if and only if it 
can be infinitely decomposed over the set of morphisms {L a ,R x \ x £ A \ {a}} where a = min(^4) 
for the given order on A. 
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The above result also follows from the following generalization of a result on Sturmian words 
given by Borel and Laubie [8] (see also |43j ) . 

Theorem 6.11. An A-strict episturmian word t is an infinite Lyndon word if and only ift = as 
where a = min(A) for the given order on A and s is an (aperiodic) A-strict epistandard word. 
Moreover, if A is the L-spinned directive word of s, then t = as is the unique episturmian word in 
the subshift of s directed by the spinned version of A having all spins R, except when Xi = a. 

The proof of the above theorem requires the following result that is essentially Theorem 3.17 
from [23], apart from the fact that as is in the subshift of s, which follows from Fact 12. 3[ 

Theorem 6.12. Suppose s is an epistandard word directed by A = £1X2X3 • • • and let a be a 
letter. Then as is an episturmian word if and only if a € Ult(A), in which case as is the unique 
episturmian word in the subshift of s directed by the spinned version of A having all spins R, except 
when Xi = a. 

Proof of Theorem \6.11[ Let A = {ai, 02, . . . , a m } with a\ < 02 < ■ ■ ■ < a m . By Corollary 16.91 an 
A-strict episturmian word t is an infinite Lyndon word if and only if the (normalized) directive word 
A of t belongs to {ai,a2, ■ ■ ■ ,a m }", i.e., if and only if t = ais where s is the unique epistandard 
word directed by the L-spinned version of A, by Theorem 16. 121 □ 

Note. If s is an epistandard word over A, then as is an infinite Lyndon word for any order such 
that a = min(A). 

Let us point out that completely different proofs of Corollary 16.91 and Theorem 16.111 using a 
characterization of episturmian words via lexicographic orderings, were given in [15] by the first 
author. A refinement of one of the main results in [TB] is also given in |15| . 

To end, let us observe that, contrary to the fact that there exists |A|! possible orders of a finite 
alphabet A, Theorem 16.111 shows that there exist exactly |A| infinite Lyndon words in the subshift 
of a given .A-strict epistandard word s, when |A| > 1 (since there are no Lyndon words when 
|A| = 1). That is, for any order with min(A) = a, the subshift of s contains a unique infinite 
Lyndon word beginning with a, namely as. 

Example 6.13. With A = (abed)" , the spinned versions (abed)" , (abed)", (abed)", (abed)", 
(abed)", (abed)", (abed)" and their opposites direct non-quasiperiodic episturmian words in the 
subshift of the 4-bonacci word z. Only the first four of these words direct Lyndon episturmian 
words: az, bz, cz, dz, respectively. 

7 Concluding remarks 

In [35], Monteil proved that any Sturmian subshift contains a multi-scale quasiperiodic word, i.e., 
an infinite word having infinitely many quasiperiods. A shorter proof of this fact was provided in 
|29] . This can be easily extended to episturmian words. Certainly, by Fact 13 .4] any episturmian 
subshift contains at most two epistandard words (one in the aperiodic case and two in the periodic 
case) and any epistandard word has infinitely many quasiperiods (by Theorem 14. 10p . 

Actually the characterization of quasiperiodic Sturmian words in [29] shows that in any Sturmian 
subshift there are only two non-quasiperiodic Sturmian words and all other (Sturmian) words in 
the subshift have infinitely many quasiperiods. It is easy to see that the same result does not hold 
for episturmian words defined over an alphabet containing more than two letters. For instance, 
any episturmian word having a spinned directive word in {abc, abc}" is non-quasiperiodic: all of 
these non-quasiperiodic episturmian words belong to the subshift of the Tribonacci word r, directed 
by (abc)". Moreover, one can verify (using Theorem I3.2p that the quasiperiodic episturmian word 
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t directed by (abc) n (ab^^abc) 011 for some n > 1 (which is in the subshift of r) has exactly n + 1 
directive words: 

(aft^^ft^a^^-^aSc) 1 ^, < t < n. 

Hence it is clear from Theorem 14,191 that t has only finitely many quasiperiods. (See also Exam- 
ples 
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